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"SOLUBLE EPIDERMAL GROWTH FACTOR RECEPTOR-LIKE PROTEINS AND THEIR 
. USES IN CANCER DETECTION METHODS" 

This application claims priority from Udr^pplication No. 60/157,144, filed September 
5 30, 1999, entitled "SOLUBLE EPIDERMAL GROWTH FACTOR RECEPTOR-LIKE 
PROTEINS AND THEIR USES". 

The disclosed invention was made with the support of grants from the National Institutes 
of Health grants 1R01 CA57534 (Truncated c-erbB Receptors in Women with Ovarian Cancer) 
to Nita J. Maihle, CA09441-13 (Multidisciplinary Basic Research Training in Cancer) to the 
P; 10 Mayo Foundation, and 1K07 CA76170-01A1 (Soluble ErbBl molecules as Tumor Biomarkers) 
F; : to Andre T. Baron. The U.S. Government has certain rights in the invention. 
% FIELD OF THE INVENTION 

q 1 5 The present invention relates to the discovery of soluble isoforms of an Epidermal 

fU Growth Factor Receptor, or sErbBl variants, the nucleic acid sequences encoding these 

C isoforms, purified recombinant proteins, novel antibodies specific for these isoforms, and the use 

of immunoassay and other protein assay techniques to measure the concentration of these protein 
isoforms in a patient biological sample in the femtomolar range. The present invention also 
20 provides diagnostic methods for determining the presence of an ovarian carcinoma in the patient 
by assaying the concentration of soluble ErbBl variants in a biological sample from a patient. 



BACKGROUND OF THE INVENTION 
The epidermal growth factor receptor (EGFR) is a transmembrane glycoprotein encoded 
25 by the c-erbBl proto-oncogene. Sequence analysis demonstrated that the human EGFR is the 
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cellular homolog of the v-erbB oncogene from the avian erythroblastosis retrovirus (Downwald 
et aL, Nature, 307, 521 (1984); Ullrich et aL, Nature, 309, 418 (1984)). In recent years, a series 
of c-erbB related cell surface receptor tyrosine kinases has been identified. The four members of 
the c-erbB proto-oncogene family are: ErbBl/B^R, ErbB2/Neu (Coussens et aL, Science, 230, 
5 1 132 (1988)); ErbB3 (Kraus et aL, Proc. Nat'l. Acad. Sci. USA, 86, 9193 (1989)); Plowman, et 
aL, Proc. Nat'l. Acad. Sci. USA 87, 4905 (1991)); and ErbB4 (Plowman et aL, Proc. Nat'l. Acad. 
Sci. USA, 20,1746 (1993)). Distinct genes encode these receptors. 

The ErbBl/EGFR includes three functional domains: an extracellular ligand binding 
domain, a transmembrane domain, and a cytoplasmic tyrosine kinase domain. The extracellular 

10 domain can be further divided into four subdomains (I-IV), including two cysteine-rich regions 
(II and IV) and two flanking regions (I and III) (Maihle et aL, Proc. Nat'l Acad. Sci., 88, 1825 
(1991)). Subdomains I and III are involved in ligand binding. Ligand binding to the receptor 
represents the first event in a complex cascade culminating in DNA synthesis and cell division. 
The full-length 170 kD human EGFR is encoded by two alternatively spliced transcripts of 5.8 

1 5 and 1 0.5 kb (Ullrich et aL , supra). A truncated 80 kD EGFR present in human placenta extracts 
may be encoded by an alternatively spliced 1.8 kb RNA transcript (Ilekis et aL, Mol. Reprod. 
Devel., 41, 149 (1995)). Furthermore, soluble EGF receptors arise from aberrant transcription 
products in carcinoma derived cell lines, as exemplified by the epidermoid carcinoma line, A431 
(Ullrich et aL, supra). In this cell line, the EGFR gene is amplified and rearranged, and a 2.8 kb 

20 transcript arises from a translocation between the 5 f - region of the EGFR gene and an 

unidentified region of genomic DNA (Ullrich et aL, supra; Merlino et aL, Mol. Cell. Biol., 5, 
1722 (1985); Hunts et aL, Cell Mol. Genet., 1 1, 477 (1988)). Alternatively spliced c-erbBl RNA 
transcripts of approximately 2.6-2.7 kb that encode secreted, truncated receptors containing only 
the extracellular ligand binding domain are also found in normal chicken and rat tissue (Maihle 

25 et aL, supra; Petch et aL, Mol. Cell. Biol, 10, 2973 (1990)). 
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Soluble forms of the ErbB family of receptors have been investigated in connection with 
several cancers (McKenzie, Biochim. Biophys. Acta, 1072, 193 (1991); Brandt-Rauf, Mutat. 
Res., 333, 203 (1995)). Immunoassay studies show that sErbB2 proteins are elevated in serum 
samples of patients with breast and ovarian cancer (Mori et aL, Jpn. J. Cancer. Res., 81, 489 
5 (1990); Meden et aL, Anticancer Res., 17, 757 (1997)). Recent studies suggest that low 

pretreatment serum sErbB2 levels are positive predictors of responsiveness to hormonal therapy 
for patients with metastatic breast cancer (Hayes et aL, Breast Cancer Treat., 14, 135, (1993); 
Leitzel et aL, J. Clin. Oncol., 13, 1129 (1995); Yamauchi et aL, J. Clin. Oncol, 15, 2518 (1997)). 
Meden et aL {supra) have reported a positive association between elevated serum pi 05 sErbB2 

10 levels and shorter survival for patients with stage I through IV epithelial ovarian cancer (EOC). 
In addition, U.S. Patent No. 5,674,763, issued October 7, 1997, described the use of antibodies 
against the external EGF binding domain of EGF receptors to diagnose neoplastic diseases 
correlated with an increase in the level of an EGF receptor in a patient's blood. 

Immunoassay studies show that the extracellular domain of ErbB 1 is detectable and 

15 increased in the serum of patients with asbestosis-induced lung cancer (Partanen et aL, J. Occup. 
Med., 36, 1324 (1994); Partanen et aL, Int. J. Oncol., 4, 1025 (1994)) and in the urine of patients 
with squamous cell carcinomas of the head, neck, and lung (Witters et aL, Clin. Cancer Res., 1, 
551 (1995)). Ilekis et aL (Gynecol. Oncol., 65, 36 (1997)) have recently observed a positive 
association between levels of a p80 sErbBl protein and full-length ErbBl in tissue samples of 

20 serous cystadenocarcinomas of the ovary. In addition, ErbB 1 receptors have been shown to be 
over-expressed in various human tumor cell lines and neoplasms (Xu et aL, Proc. Nat'l. Acad. 
Sci, 81, 7308 (1984); Salomons aL, Crit. Rev. Oncol. Hematol, 19, 183 (1995)), including 
cancers of the breast (King et aL, Science, 229, 974 (1985)), lung (Hendler et aL, Proc. Am. Soc. 
Clin. Oncol, 8, 223 (1989); Veale et aL, Cancer Res., 49, 1313 (1989)), brain (Schlegel et aL, 

25 Int. J. Cancer, 56, 72 (1994)), bladder (Neal et aL, Cancer, 65, 1619 (1990); Mellon et aL, J. 
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Urol., 153, 919 (1995)), and ovary (Berchuck et al, Am. J. Obstet. Gynecol, 164, 669 (1991); 
Scambia et al, J. Clin. Oncol., 10, 529 (1992)). 

Thus, the current research concerning the biological role and function of EGF receptors 
has been contradictory, and does not provide a clear indication of how any particular EGF 
5 receptor can be used as a tool in diagnosing any particular cancer type. Therefore, a need exists 
for the isolation of soluble epidermal growth factor receptor protein molecules and their 
isoforms, and for the characterization of their correlated disease conditions. In addition, a useful, 
quantitative diagnostic method to detect the presence of biologically relevant isoforms for 
diagnosing the onset and progression of diseases associated with these soluble endothelial 
10 growth factor receptor protein molecules is also needed. 

SUMMARY OF THE INVENTION 
In a first aspect, the invention provides several novel isolated and purified non-genomic 
nucleic acids which encode soluble isoforms of the human EGFR ErbBl, as well as nucleic acids 

1 5 encoding engineered variants of these proteins. Preferred embodiments of this aspect of the 

invention are nucleic acid sequences which specifically encode a soluble form of a human EGFR 
("sEGFR") whose amino acid sequence comprises the sequence of SEQ ID NO:l. The nucleic 
acid embodiments of the invention include, e.g., DNA SEQ ID NO:2, which is the naturally 
occurring sequence encoding the polypeptide SEQ ID NO:l. The nucleic acids of the invention 

20 also include nucleic acid sequences which are complementary to or synonymous with SEQ ID 
NO. 2, (i.e., also encode DNA SEQ ID NO. 1.) Other preferred embodiments of this aspect of 
the invention include nucleic acids which encode proteins which comprise a sequence which is 
has at least 90% identity with SEQ ID NO. 1 , more preferably at least 95% identity with SEQ ID 
NO. 1, more preferably at least 98% identity with SEQ ID NO. 1, and most preferably at least 

25 99% identity with SEQ ID NO. 1 . A preferred embodiment of these nucleic acids would be a 
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nucleic acid encoding the naturally occurring truncated variant SEQ ID NO. 3, which is 
approximately 90.9 % identical to SEQ ID NO. 1 (641 out of 705 amino acids). Other preferred 
embodiments of these nucleic acids include nucleic acids encoding the point-mutation proteins 
SEQ ID NO. 4, SEQ ID NO. 5, and SEQ ID NO. 6, which are approximately 99.8% identical to 
5 SEQ ID NO. 1 (704 out of 705 amino acids). Other embodiments of the nucleic acid sequence 
invention include nucleic acids which are complementary to the above nucleic acids. 

The invention also provides an expression cassette comprising a nucleic acid of the 
invention encoding a sEGFR, which is operably linked to a promoter functional in a host cell. 
Another aspect of the present invention is a method of producing sEGFR polypeptides by 

10 incorporating such cassettes into expression vectors used to transform prokaryotic or eukaryotic 
host cells to express sEGFR polypeptide. The sEGFR polypeptide may then be isolated from the 
host cell or the culture media by methods well known in the protein purification arts. The 
vectors of the invention may also contain a functional DNA sequence that is comprised of a 
selectable marker gene and/or reporter gene, as described below, or an additional nucleic acid 

15 sequence encoding a polypeptide tag for purification. Another aspect of the present invention is 
an isolated and purified sEGFR, produced as above. A preferred sEGFR comprises a 
polypeptide having SEQ ID NO:l. sEGFR can be employed in cell growth assays, ligand 
binding assays, and biomarker detection assays such as those described in Examples below. 
Yet another aspect of the present invention are polypeptides useful for generating 

20 antibodies specific to the proteins encoded by the nucleic acid sequences of the invention, 
including SEQ ID NO. 1, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, and SEQ ID NO. 6. 
The polypeptides of the invention comprise an amino acid sequence of 10 to 25 amino acids in 
length, more preferably 1 1 to 21 amino acids in length, and most preferably 14 to 20 amino acids 
in length, wherein the amino acid sequence is identical to an amino acid sequence of similar 

25 length in an amino acid sequence selected from the group consisting of: amino acids 628-705 of 
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SEQ ID NO. 1, amino acids 628-705 of SEQ ID NO. 4, amino acids 628-705 of SEQ ID NO. 5, 
and amino acids 628-705 of SEQ ID NO. 6. Another aspect of the invention is an immunogenic 
conjugate comprising one of the above polypeptides and a carrier molecule. Preferred carrier 
molecules for use in immunogenic conjugates include keyhole limpet hemocyanin (KLH) and 
5 bovine serum albumin (BSA). Yet another aspect of the present invention are monoclonal or 
polyclonal antibodies produced using the above polypeptides which is specific for a protein 
selected from the group consisting of: SEQ ID NO. 1, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID 
NO. 5, and SEQ ID NO. 6; and which do not cross-react with other ErbBl isoforms. 
Another aspect of the present invention is an expression cassette comprising: a 

10 preselected DNA segment that is complementary to SEQ ED NO:2 that is operably linked to a 
promoter functional in a host cell. Thus, the present invention provides an expression cassette 
that expresses an "antisense" mRNA transcript of a DNA sequence of the invention. Another 
aspect of the invention is a method of using this transcript by transforming a host cell with an 
expression cassette comprising the complementary sequence which is expressed within the host 

15 cell, and thus altering EGFR expression, cell growth and/or differentiation of the host cell. In 
addition, such complementary transcripts may be utilized in RNAse protection assays to 
determine the level of cellular expression of mRNAs encoding SEQ ID NO. 1. 

Yet another aspect of the invention is a sandwich immunoassay method for detecting or 
determining the concentration of soluble human epidermal growth factor receptor in a biological 

20 sample obtained from a patient. The method comprises: a) contacting an amount of a first 
purified antibody that specifically reacts with a first epitope of the extracellular ligand binding 
domain of sErbBl with the patient biological sample to be tested, wherein the first purified 
antibody is modified with a first labeling moiety, b) contacting the sample with an amount of a - 
second purified antibody that specifically reacts with a second epitope of the extracellular ligand 

25 binding domain of sErbB 1 , wherein the second purified antibody is modified with a second 
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labeling moiety, and wherein the second purified antibody does not competitively inhibit the 
binding of the first purified antibody, and c) determining presence or amount of the soluble 
epidermal growth factor receptor complexed with said antibodies by detecting the co-presence of 
the first and second labels. In preferred embodiments of this assay, the first antibody is either 
5 MAb R. 1 or an antibody which binds to the same epitope as MAb R. 1 (i.e. competitively inhibits 
the binding of MAb R.l to the ligand binding domain of EGFR). In further preferred 
embodiments, the second antibody is MAb 528, or an antibody which binds to the same epitope 
as MAb 528 (i.e. competitively inhibits the binding of MAb 528 to the ligand binding domain of 
EGFR). In especially preferred embodiments, either the first or second labeling moiety is 

1 0 acridinium. In preferred embodiments of this aspect of the invention the patient biological 
sample is blood, serum, plasma, urine, or ascites fluid. 

The invention further provides a diagnostic method for determining the presence, or risk, 
of an ovarian carcinoma in a female human patient. The method comprises a) determining the 
concentration of soluble EGFR in a biological sample obtained from a female patient (e.g., by the 

15 above immunochemical method,) b) comparing the concentration obtained in a) with a normal or 
baseline level for soluble EGFR that is preferably established with samples from female humans 
without ovarian cancer, and c) correlating a decrease in the concentration of sEGFR in the 
patient's sample with the presence of an ovarian carcinoma in the patient. In further 
embodiments of the aspect of the invention, a female patient may be monitored with repeated 

20 testing to determine the onset or progression of ovarian cancer. In further embodiments, the 
female patient may be tested before and after radiation, chemotherapy, or surgical treatment to 
monitor the regression or progression of ovarian cancer. 

Yet another aspect of the invention is a method to increase the half-life of ErbBl ligands 
in the circulatory system of a human patient. In particular, the method may be uses to increase 

25 the circulatory half-life of EGF and TGF-a by binding to the ligands in the patient's blood, 
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thereby inhibiting ligand degradation and extending the half-life of these ligands in the patient. 
The method comprises administering to a human patient a ligand-half-life-increasing amount of a 
sErbBl protein with an amino acid sequence chosen from the group consisting of SEQ ID NO. 1, 
SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, and SEQ ID NO. 6. 
5 Yet another embodiment of the invention is a method to modulate the function of ErbBl 

transmembrane receptors. The sErbBl molecules encoded by the nucleic acids of the invention 
competitively bind to ligands such as EGF and TGF-a to increase or decrease relative available 
amounts of a preselected ligand in humans. In addition, the sErbBl molecules may inhibit 
ErbBl receptor tyrosine kinases through the formation of inactive heterodimers. These inactive 

10 heterodimers compete for functional receptors thereby inhibiting a phosphorylation cascade and 
inactivating signal transduction from the extracellular to the intercellular domain. The method 
comprises administering to a human patient an ErbBl -function-inhibiting amount of a sErbBl 
protein with an amino acid sequence chosen from the group consisting of SEQ ID NO. 1, SEQ 
ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, and SEQ ID NO. 6. 

1 5 Yet another embodiment of the invention is a method to regulate cellular proliferation 

and cellular differentiation. The sErbBl molecules inhibit cytokines and receptors necessary for 
normal cell proliferation and differentiation. The method comprises administering to a cell a 
cytokine-function-inhibiting amount of a sErbBl protein with an amino acid sequence chosen 
from the group consisting of SEQ ID NO. 1, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, and 

20 SEQ ID NO. 6. 

Yet another embodiment of the invention is a diagnostic kit package for detecting or 
determining sEGFR in a human biological sample. The kit comprises: (a) a solid phase capable 
of having attached thereto a first antibody; (b) a first antibody that binds to a first epitope on the 
extracellular ligand binding domain of the human epidermal growth factor receptor, wherein the 
25 first antibody is modified with an attachment label moiety; (c) a second antibody which 



-8- 



Patent 
257/225 

specifically binds to a second epitope on the extracellular ligand binding domain of the human 
epidermal growth factor receptor, wherein the second antibody is labeled with a second label 
moiety; and (d) instructions for carrying out the immunoassay of the invention. Preferred kits 
comprise antibodies specific to a protein having the amino acid sequence SEQ ID NO. 1, SEQ ID 
5 NO. 3, SEQ ID NO. 4, SEQ ID NO. 5, or SEQ ID NO.6. 

BRIEF DESCRIPTION OF THE FIGURES AND DEFINITIONS 
Figure 1 . Schematic representation of full length, p60, and normal pi 10 EGFR 
transcripts. Open boxes represent exons. Lines represent introns. 
10 Figure 2. Structural comparison of p60 and pi 10 soluble forms of EGFR relative to the 

full-length EGFR. 

Figure 3. Schematic representation of alternative exons located in human EGFR introns 15 
and 16. 

Figure 4. DNA and translated protein sequences. 

15 Figure 5. Diagram showing the effects of pi 10 sErbBl truncated protein in vitro. 

Chinese hamster ovary (CHO) cells were transfected with either the full-length human EGFR 
cDNA (encoding pi 70) in the expression vector pcDNA3 (Invitrogen) or with the vector alone. 
Stable clonal isolates were selected with G418 and these cells were then transiently transfected 
with the alternative 3.0 kb EGFR cDNA (encoding pi 10) or with vector alone. When pi 70 and 

20 pi 10 were expressed in the same cells, significant cell death was observed 24-48 hours following 
transfection; however, no cell death was observed when either of these proteins were expressed 
individually. 

Figure 6. This diagram illustrates the structure of several EGFR isoforms, including the 
full length EGF receptor, pi 70 ErbBl (A), the mutant pi 10 soluble ErbBl analog of A431 cells 
25 [which differs in sequence from normal pi 10 from human placenta, as described below] (B), the 
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p60 soluble ErbBl analog of human placenta (C), and a recombinant pi 00 soluble ErbBl analog 
(D). Additional information about these molecules is given in the chart. The full-length 170 kD 
ErbBl receptor contains an extracellular domain with four distinct subdomains (I - IV), a 
transmembrane domain, and a cytoplasmic domain. The mutant A431 pi 10 sErbBl analog 
5 contains extracellular subdomains I through IV and 17 unique amino acids at its carboxy- 

terminal end. The p60 sErbBl analog contains subdomains I and II, a portion of subdomain III, 
and 2 unique carboxy-terminal amino acids. Recombinant human pi 00 sErbBl ends at amino 
acid 589 (nucleotide numbering according to Ullrich et al., 1984) and, therefore, embodies 
subdomains I through IV without any additional unique carboxy-terminal amino acids. The 

10 mutant pi 10 and 'natural' p60 human sErbBl analogs are synthesized from truncated, 

alternatively spliced.2.8 kb and 1.8 kb mRNA transcripts of A431 carcinoma cells and normal 
placenta, respectively. Qt6 cells transfected with the plasmid vector, psErbBlECD589, 
synthesizes the recombinant human pi 00 sErbBl analog. The pi 10 sEGFR encoded by the 
nucleic acids of the invention, not shown, contains subdomains I - IV, as well as its 78 unique 

15 amino acid carboxy end sequence. 

Figure 7. Dose-response curves of the ALISA toward whole cell lysates (A) and 
conditioned culture media (B) from various cell lines that synthesize different combinations of 
ErbB-related molecules are shown (bottom axes). The standard dose-response curve with pi 70 
ErbBl also is shown on each graph (top axes). Qt6 quail fibroblasts do not synthesize any 

20 human ErbB molecules. The human breast carcinoma cell line, MDA-MB-453, is known to 
synthesize full-length ErbB2, ErbB3, and ErbB4, but not ErbBl. The human breast carcinoma 
cell line, SK-BR-3, is known to synthesize complete ErbBl, ErbB2, ErbB3, and ErbB4 receptors. 
The dose-response curves with A431 and SK-BR-3 whole cell lysates are positive, and those 
with MDA-MB-453 and Qt6 whole cell lysates are negative (A). A431 cells secrete a mutant 

25 pi 10 sErbBl analog; whereas Qt6, Qt6/pDR161, and Qt6/psErbBlECD589 cells secrete no 
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ErbBl -related molecules, p60 sErbBl, and plOO sErbBl, respectively. MDA-MB-453 and SK- 
BR-3 cells are not known to secrete sErbBl molecules. However, SK-BR-3 cells have been 
shown to secrete a sErbB2 analog of approximately 105 kD. The dose-response curves with 
A431 and Qt6/psErbBlECD589 conditioned media are positive, and those with Qt6/pDR161 and 
5 SK-BR-3 conditioned media are negative (13). Thus, as is shown by these graphs, the ALISA 
described in Example V detects ErbBl isoforms which contain subdomains I-IV (A431 pi 10, 
pi 00, pi 70, and by implication placental pi 10), but does not detect isoforms which contain only 
sub domains I-IIL 

Figure 8. The concentration of sErbBl in normal human female (n^40) and male (n^40) 
10 sera as measured by the ALISA of Example V are compared as described in Example VI. 

Figure 9. The serum sErbBl concentrations for healthy females (A) and healthy males 
(B) are plotted as a function of patient age. 

Figure 10. The serum sErbBl concentrations of the healthy females are plotted as a 
function of pre- or post-menopause status. 
15 Figure 11. The serum sErbBl concentrations of pre-menopausal females are plotted with 

an age-matched group of males. 

Figure 12. The serum sErbBl concentrations, measured using the ALISA of Example V, 
of healthy women in a control group, with pre-operative endothelial ovarian cancer (EOC), with 
benign ovarian tumors, and with benign gynecological diseases are shown. The median sErbBl 
20 concentration of women with EOC is significantly less than that of women without ovarian 
cancers. 

Figure 13. The serum sErbBl concentrations, measured using the ALISA of Example V, 
of healthy women in a control group age-matched to the pre-operative EOC group, EOC patients 
who gave pre-operative samples, a control group age-matched to the post-operative EOC group, 
25 a post-operative EOC group 0-34 days after surgery, and a post-operative EOC group 35-287 
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days after surgery. 

Figure 14. Normal human male and female sera were immunoprecipitated with Protein-G 
MAb minus, R.1-, 225-, or 528-coupled resins and Western blotted with a mixture (15E1 1, 2D2, 
LA22, and CI 1) of anti-ErbBl ECD-specific MAbs. A reactive band of approximately 1 10 kD 
5 eluted from each MAb affinity resin, but not from the MAb minus resin used to 
immunoprecipitate normal male or female sera. 

Figure 15. This figure shows the dose-response curves of the current Oncogene Research 
Products ELISA with the pi 70 ErbBl standard supplied in the Oncogene Research Products kit 

0 and with the pi 70 ErbBl standard, purchased from Sigma (A). Serial dilutions of both pi 70 

CP 10 ErbBl standards and the zero calibrator, trpEcentrin, were prepared and assayed in both standard 

01 and sample diluents. Both standards show stronger dose-response curves in the sample diluent 
Hj than the standard diluent. In addition, the pi 70 ErbBl standard supplied by Oncogene Research 
g Products gives stronger response curves than the pi 70 ErbBl standard purchased from Sigma 

Chemical. Serum sErbBl levels measured with the Oncogene Research Products ELISA for 
!S 15 forty healthy human subjects (20 women and 20 men) are compared to the sErbBl levels 
J? measured with our ErbB 1 ECD-specific ALISA (B). The sErbBl levels determined with the 

Oncogene Research Products ELISA for men are not significantly different from those of 
women. No association between the sErbBl values obtained with the Oncogene Research 
Products ELISA and those obtained with the ErbBl ECD-specific ALISA is observed. 

20 

DEFINITIONS 

As used herein, the term "soluble" epidermal growth factor receptor (sEGFR) means that 
the epidermal growth factor receptor polypeptide is found in a form that is not anchored to the 
membrane of a cell, i.e., a portion of the sEGFR is not found physically embedded in the lipid 
25 bilayer which comprises the cell membrane in the organism of its origin. Preferred soluble forms 
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of the receptor are secreted by the cell or proteolytically cleaved from the cell surface. Other 
preferred soluble forms of the receptor may comprise the extracellular ligand binding domain of 
the EGFR, and lack at least a portion of the transmembrane domain (TM), i.e., the membrane- 
anchoring domain of the EGFR, so as to result in an EGFR which is not firmly anchored to, 
5 attached to, or embedded in the cell membrane. 

As used herein, the terms "isolated and/or purified" refer to in vitro isolation of a nucleic 
acid or polypeptide molecule from its natural cellular environment, and from association with 
other naturally occurring components of the cell. Such molecules may then be sequenced, 
replicated, manipulated, and/or recombined for artificial in vivo or in vitro expression. Thus, the 

10 RNA or DNA is "isolated" in that it is free from at least one contaminating nucleic acid with 
which it is normally associated in the natural source of the RNA or DNA and is preferably 
substantially free of any other mammalian RNA or DNA. The phrase "free from at least one 
contaminating source nucleic, acid with which it is normally associated" includes the case where 
the nucleic acid is reintroduced into the source or natural cell but is in a different chromosomal 

15 location or is otherwise flanked by nucleic acid sequences not normally found in the source cell. 

As used herein, the term "biologically activity" of a peptide of the invention is defined to 
mean a polypeptide comprising a subunit of a peptide having SEQ ED NO:l, or a variant thereof, 
which has at least about 10%, preferably at least about 50%, and more preferably at least about 
90%o, of the activity of a peptide having SEQ ID NO: 1 . The activity of a peptide of the invention 

20 can be measured by methods well known in the art including, but not limited to, the ability to 
bind EGF, or the ability of the peptide to elicit a sequence-specific immune response when the 
peptide is administered to an organism, e.g., goat, sheep or mice. 

As used herein with regard to polypeptide sequences, the term "%> identity" means the 
percentage of amino acids in a compared sequence which are identical with the amino acids in a 

25 reference sequence, when both sequences are aligned and gaps are introduced, when necessary, 
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to produce the best match. As used herein, sequences with "substantial identity" are at least 90% 
identical. It is also preferred that the non-identical amino acids in the compared sequence be 
conservatively substituted with like amino acids in substantially identical polypeptides. 

As used herein, the term "complementary," when used to describe nucleic acids, refers to 
5 the ability of the nucleic acids to hybridize with each other. Preferred complementary nucleic 
acids have an exact complementarity with regard to A/T and G/C matching. However, 
substantially complementary nucleic acids which hybridize to RNA or DNA and remain stably 
bound under stringent conditions, as defined by methods well known in the art (Sambrook et aL, 
supra), would be sufficiently complementary for some uses (e.g., as anti-sense nucleotide 
10 sequences.) 

As used herein, the term "synonymous," when used to describe nucleic acids, refers to the 
polypeptide sequence encoded by a compared nucleic acid relative to a reference nucleic acid. 
The genetic code is well known by those of skill in the art, and sequences with codon 
substitutions which encode the same amino acid can be easily devised for various purposes ( e.g., 

15 introducing convenient restriction enzyme cleavage sites or optimizing codon usage for a 

particular recombinant protein production organism) without changing the translated polypeptide 
sequence. Utilizing common tools such as phosphoramidite polynucleotide synthesis and site 
directed mutagenesis or other recombinant techniques, such substitutions may easily be effected 
by those of ordinary skill in the art. Thus, sequences which are synonymous with the exemplary 

20 nucleotide sequences are also considered to be within the scope of the present invention. 

However, sequences which do not directly encode the translation of the same protein would not 
be considered to be "synonymous." Specifically, the full genomic ErbBl gene is not considered 
to be synonymous with the nucleic acids of the invention, as the full length gene can encode 
many alternatively spliced transcripts, including the nucleic acids of the invention, but does not 

25 directly encode proteins such as SEQ ID NO. 1. 
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The terms "recombinant nucleic acid" or "preselected nucleic acid," e.g., "recombinant 
DNA sequence or segment" or "preselected DNA sequence or segment" refer to a nucleic acid 
that has been derived or isolated from any appropriate tissue source and that may be 
subsequently chemically altered, typically in vitro, so that its sequence is not naturally occurring, 
5 or corresponds to naturally occurring sequences that are not positioned as they would be 

positioned in a genome which has not been transformed with exogenous DNA. An example of 
preselected DNA "derived" from a source, would be a DNA sequence that is identified as a 
useful fragment within a given organism, and which is then chemically synthesized in essentially 
pure form. An example of such DNA "isolated" from a source would be a useful DNA sequence 

1 0 that is excised or removed from said source by chemical means, e.g. , by the use of restriction 
endonucleases, so that it can be further manipulated, e.g., amplified, for use in the invention, by 
the methodology of genetic engineering. 

"Regulatory sequences" is defined to mean RNA or DNA sequences necessary for the 
expression, post-transcriptional modification, translation, and post-translational modification of 

15 an operably linked coding sequence in a particular host organism. The control sequences that are 
suitable for prokaryotic cells, for example, include a promoter, and optionally an operator 
sequence, and a ribosome binding site. Eukaryotic cells are known to utilize promoters, stop 
sequences, enhancers, splicing, and polyadenylation signal sequences, as well as glycosylation 
and secretory signal sequences. 

20 "Operably linked" is defined to mean that nucleic acids are placed in a functional 

relationship with one another in a nucleic acid sequence. For example, DNA for a secretory 
leader is operably linked to DNA for a polypeptide if it is expressed as a pro-polypeptide that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a 
coding sequence if it affects the transcription of the sequence; or a ribosome binding site is 

25 operably linked to a coding sequence if it is positioned to facilitate translation. Generally, 
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"operably linked" means that the DNA sequences being linked are contiguous and, in the case of 
a secretory leader, contiguous and in reading phase. However enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do 
not exist, the synthetic oligonucleotide linkers or mutagenesis are used in accord with 
5 conventional practice. 

As used herein, the term "cell line" or "host cell" is intended to refer to well- 
characterized homogenous, biologically pure populations of cells. These cells may be eukaryotic 
cells that are neoplastic or which have been "immortalized" in vitro by methods known in the art, 
as well as primary cells, or prokaryotic cells. The cell line or host cell is preferably of 

1 0 mammalian origin, but cell lines or host cells of non-mammalian origin may be employed, 

including avian, plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is related to a DNA sequence that is resident in the genome of the host cell but is not 
expressed, or not highly expressed, or, alternatively, over-expressed. 

The terms "transfected" or "transformed" are used herein to include any host cell or cell 

1 5 line, the genome of which has been altered or augmented by the presence of at least one 

preselected DNA sequence, which DNA is also referred to in the art of genetic engineering as 
"heterologous DNA," "recombinant DNA," "exogenous DNA," "genetically engineered DNA," 
"non-native DNA," or "foreign DNA," wherein said DNA was isolated and introduced into the 
genome of the host cell or cell line by the process of genetic engineering. The host cells of the 

20 present invention are typically produced by transfection with a DNA sequence in a plasmid 

expression vector, a viral expression vector, or as an isolated linear DNA sequence. Preferably, 
the transfected DNA is a chromosomally integrated recombinant DNA sequence, which 
comprises a gene encoding sEGFR, which host cell may or may not express significant levels of 
autologous or "native" sEGFR. 
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DETAILED DESCRIPTION OF THE INVENTION 
In order to more fully understand the mechanism by which the sEGFR influences cell 
growth and differentiation in normal tissue, cDNAs which encode the extracellular ligand 
binding domain, but not the cytoplasmic kinase domain, of the human epidermal growth factor 
5 were isolated from a human placental cDNA library. Besides identifying a soluble form of the 
human epidermal growth factor receptor which may play a role in, or be associated with, cell 
growth and differentiation in normal cells, the identification and isolation of cDNAs encoding 
novel sEGFR transcripts has proven useful for defining the molecular basis for some neoplastic 
processes. Applicants have utilized monoclonal antibodies against the extracellular domain of 

10 EGFR to produce a useful diagnostic test which demonstrates the correlation between the 

discovered forms of sEGFR and ovarian cancer. By further utilizing the nucleic acid sequences 
of the invention to recombinantly produce the encoded sEGFRs, or producing specific antibodies 
to the unique carboxy terminal sequence of these proteins, other important advances in 
therapeutics and diagnostics may be realized through the invention. 

1 5 Applicants have demonstrated a significant correlation between the levels of sEGFR and 

ovarian cancer. Thus, patient samples, e.g., tissue biopsies, sera or plasma, may now be analyzed 
with antibodies specific for the sEGFR to detect the presence and progression of ovarian 
carcinomas in patients. As demonstrated by the examples below, applicants have found the level 
of sEGFR in patient samples to be useful in determining residual disease, responsiveness to 

20 chemotherapy, or overall survival. In addition, the levels of particular isotypes of sEGFR in a 
patient may be a useful indicator of the histological subtype of a tumor. 

Furthermore, the cloning of transcripts encoding sEGFRs will elucidate the molecular 
mechanism giving rise to the presence or absence of sEGFR in patients with disease. Once the 
molecular mechanism underlying the expression of sEGFR is understood, molecular genetic 

25 based therapies directed to controlling the expression of sEGFR can then be employed to correct, 
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inhibit or supplement the expression of sEGFR or full-length EGFR in patients with the disease. 
For example, an expression vector containing DNA encoding antisense sEGFR transcripts can be 
introduced into tumor cells to inhibit or reduce the overexpression of full-length EGFR. 

The cDNAs encoding sEGFR can also be employed in expression cassettes to synthesize 
5 sEGFR in vitro. In vitro prepared sEGFR can be employed to obtain antibodies specific for 
soluble forms of the EGFR. In vitro synthesized sEGFR may also be employed in a 
pharmaceutical formulation which, when administered to a human, can form inactive 
heterodimers with full-length EGFR to block EGFR activation, compete with the full-length 
EGFR for ligand, or block EGFR signal transduction post-ligand binding, and thus suppress or 

1 0 decrease the growth stimulatory activity of the EGFR. 

The cDNAs of the present invention are useful for detecting the expression of sEGFR, for 
detecting related DNA molecules and for amplifying nucleic acid sequences, wherein said 
sequences fall within the scope of the present invention. The antibodies of the invention, besides 
being useful to detect sEGFR levels in patient samples, maybe useful in vivo to inhibit the natural 

15 functions of sEGFR in various disease states. 

Sources and Isolation of Nucleic Acids Encoding sEGFR 

Sources of nucleotide sequences from which the present DNA molecules encoding 
human sEGFR can be derived include total or polyA + RNA from any human cellular source, 
20 preferably from embryonic cells such as those from placental tissue, carcinomas, or cell lines 
derived therefrom, from which cDNAs encoding sEGFR can be derived by methods known in 
the art and described below in Example I. Other sources of the DNA molecules of the invention 
include cDNA libraries derived from any human cellular source including placental cDNA 
libraries. 

25 A nucleic acid molecule encoding sEGFR can be identified and isolated using standard 
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methods, as described by Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor, NY (1989). For example, reverse transcriptase PCR (RT-PCR) can be employed 
to isolate and clone sEGFR cDNAs. 

Oligo-dT can be employed as a primer in a reverse transcriptase reaction to prepare first- 
5 strand cDNAs from isolated RNA that contains RNA sequences of interest, e.g., total RNA 
isolated from human placental tissue. RNA can be isolated by methods known to the art, e.g., 
using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, Gaithersburg, MD). Resultant first- 
strand cDNAs are then amplified in PCR reactions. 

Primers are made to correspond to highly conserved regions of polypeptides or nucleotide 
10 sequences which were identified and compared to generate the primers, e.g., by a sequence 

comparison of other mammalian or avian EGFRs, particularly sEGFRs. One primer is prepared 
which is predicted to anneal to the antisense strand, and another primer prepared which is 
predicted to anneal to the sense strand, of a DNA molecule that encodes a soluble form of the 
human EGFR. 

1 5 The products of each PCR reaction are separated by an agarose gel and all consistently 

amplified products are gel purified and cloned directly into a suitable vector, such as a known 
plasmid vector. The resultant plasmids are subjected to restriction endonuclease and dideoxy 
sequencing of double-stranded plasmid DNAs. 

Another approach to identify, isolate and clone cDNAs that encode sEGFR is to screen a 

20 cDNA library generated from embryonic tissue. Screening for DNA fragments that encode all or 
a portion of a cDNA encoding sEGFR can be accomplished by probing the library with a probe, 
which has sequences that are highly conserved between genes believed to be related to sEGFR, 
e.g., DNA encoding rat or avian sEGFR or encoding sEGFR from A431 cells, or by screening of 
plaques for binding to antibodies that specifically recognize sEGFR. DNA fragments that bind 

25 to a probe having sequences which are related to sEGFR, or which are immunoreactive with 
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antibodies to sEGFR can be subcloned into a suitable vector and sequenced and/or used as 
probes to identify other cDNAs encoding all or a portion of sEGFR. 

Thus, recovery or isolation of a given fragment of DNA from a restriction digest can 
employ separation of the digest on polyacrylamide or agarose gel by electrophoresis, 
5 identification of the fragment of interest by comparison of its mobility versus that of marker 
DNA fragments of known molecular weight, removal of the gel section containing the desired 
fragment, and separation of the gel from DNA (Lawn et ah, Nucleic Acids Res., 9, 6103 (1981); 
Goeddel et ah, Nucleic Acids Res., 8, 4057 (1980)). 

10 Variants of the DNA Molecules of the Invention 

Nucleic acid molecules encoding amino acid sequence variants of sEGFR are prepared by 
a variety of methods known in the art. These methods include, but are not limited to, isolation 
from a natural source (in the case of naturally occurring amino acid sequence variants such as 
SEQ ID NO. 3-6) or preparation by oligonucleotide-mediated (or site-directed) mutagenesis, 

15 PCR mutagenesis, and cassette mutagenesis of an earlier prepared variant or a non-variant 
version of sEGFR polypeptide. 

Oligonucleotide mediated mutagenesis is a preferred method for preparing amino acid 
substitution variants of sEGFR. This technique is well known in the art (Adelman et ah, DNA, 
2, 183 (1983)). Briefly, sEGFR DNA is altered by hybridizing an oligonucleotide encoding the 

20 desired mutation to a DNA template, where the template is the single-stranded form of a plasmid 
or bacteriophage containing the unaltered or native DNA sequence of sEGFR. After 
hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of 
the template that will thus incorporate the oligonucleotide primer, and will code for the selected 
alteration in the sEGFR DNA. 

25 Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal 
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oligonucleotide will have 1 2 to 1 5 nucleotides that are completely complementary to the 
template on either side of the nucleotide(s) coding for the mutation. This ensures that the 
oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The 
oligonucleotides are readily synthesized using techniques known in the art (Crea et al, Proc. 
5 Nat'l. Acad. ScL, 75, 5765 (1978)). 

The DNA template can be generated by those vectors that are either derived from 
bacteriophage M13 vectors (the commercially available M13mpl8 and M13mpl9 vectors are 
suitable), or those vectors that contain a single-stranded phage origin of replication as described 
by Viera et al, Meth. Enzymol., 153, 3 (1987). Thus, the DNA that is to be mutated may be 
10 inserted into one of these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al, Molecular Cloning 
Laboratory Manual (Cold Spring Harbor Laboratory Press, N.Y. (1989)). Alternatively, single- 
stranded DNA template may be generated by denaturing double-stranded plasmid (or other) 
DNA using standard technique 
1 5 For alteration of the native DNA sequence (to generate amino acid sequence variants, for 

example), the oligonucleotide is hybridized to the single-stranded template under suitable 
hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA 
polymerase I, is then added to synthesize the complementary strand of the template using the 
oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus formed such that one 
20 strand of DNA encodes the mutated form of the sEGFR, and the other strand (the original 

template) encodes the native, unaltered sequence of the sEGFR. This heteroduplex molecule is 
then transformed into a suitable host cell, usually a prokaryote such as E. coll JM101. After the 
cells are grown, they are plated onto agarose plates and screened using the oligonucleotide 
primer radiolabeled with 32-phosphate to identify the bacterial colonies that contain the mutated 
25 DNA. The mutated region is then removed and placed in an appropriate vector for polypeptide 
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production, generally an expression vector of the type typically employed for transformation of 
an appropriate host. 

The method described immediately above may be modified such that a homoduplex 
molecule is created wherein both strands of the plasmid contain the mutations(s). The 
5 modifications are as follows: The single-stranded oligonucleotide is annealed to the single- 
stranded template as described above. A mixture of three deoxyribonucleotidetriphosphates, 
deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is 
combined with a modified thiodeoxyribocytosine called dCTP-(aS) (Amersham Corporation). 
This mixture is added to the template-oligonucleotide complex. Upon addition of DNA 
1 0 polymerase to this mixture, a strand of DNA identical to the template except for the mutated 

bases is generated. In addition, this new strand of DNA will contain dCTP-(aS) instead of dCTP, 
which serves to protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked with an 
appropriate restriction enzyme, the template strand can be digested with ExoIII 
1 5 nuclease or another appropriate nuclease past the region that contains the site(s) to be 

mutagenized. The reaction is then stopped to leave a molecule that is only partially single- 
stranded. A complete double-stranded DNA homoduplex is then formed using DNA polymerase 
in the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This 
homoduplex molecule can then be transformed into a suitable host cell such as E. coli JM101. 
20 A preferred embodiment of the invention is an isolated and purified DNA molecule 

comprising a preselected DNA sequence encoding a sEGFR polypeptide comprising SEQ ID 
NO:2 having nucleotide substitutions which are "silent." That is, when nucleotide substitutions 
are present in a codon, the same amino acid is encoded by the codon with the nucleotide 
substitution as is encoded by the codon without the substitution. For example, leucine is 
25 encoded by the codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO:2 at the first codon 
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in the mature polypeptide (CTG in SEQ ID NO:2) includes the substitution of CTT, CTC or 
CTA for CTG. Nucleotide substitutions can be introduced into DNA segments by methods well 
known to the art (Sambrook et ah, supra). 

5 Chimeric Expression Cassettes 

The recombinant or preselected DNA sequence or segment used to prepare expression 
cassettes for transformation, may be circular or linear, double-stranded or single-stranded. 
Generally, the preselected DNA sequence or segment is in the form of chimeric DNA, such as 
plasmid DNA that can also contain coding regions flanked by control sequences that promote the 

10 expression of the preselected DNA present in the resultant cell line. Aside from preselected 
DNA sequences that serve as transcription units for sEGFR or portions thereof, a portion of the 
preselected DNA may be untranscribed, serving a regulatory or a structural function. For 
example, the preselected DNA may itself comprise a promoter that is active in mammalian cells, 
or may utilize a promoter already present in the genome that is the transformation target. Such 

15 promoters include the CMV promoter, as well as the SV40 late promoter and retroviral LTRs 
(long terminal repeat elements), although many other promoter elements well known to the art 
may be employed in the practice of the invention. A preferred promoter useful in the practice of 
the invention is the CMV promoter. Another preferred promoter useful in the practice of the 
invention is the Rous Sarcoma Virus LTR promoter. 

20 Other elements functional in the host cells, such as introns, enhancers, polyadenylation 

sequences and the like, may also be a part of the preselected DNA. Such elements may or may 
not be necessary for the function of the DNA, but may provide improved expression of the DNA 
by affecting transcription, stability of the mRNA, or the like. Such elements may be included in 
the DNA as desired to obtain the optimal performance of the transforming DNA in the cell. 

25 The preselected DNA to be introduced into the cells further will generally contain either a 
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selectable marker gene or a reporter gene or both to facilitate identification and selection of 
transformed cells from the population of cells sought to be transformed. Alternatively, the 
selectable marker may be carried on a separate piece of DNA and used in a co-transformation 
procedure. Both selectable markers and reporter genes may be flanked with appropriate 
regulatory sequences to enable expression in the host cells. Useful selectable markers are well 
known in the art and include, for example, antibiotic and herbicide-resistance genes, such as neo, 
hpt, dhfr, bar, aroA, dapA and the like (Lundquist et ah (U.S. Patent No. 5,848,956)). 

Reporter genes are used for identifying potentially transformed cells and for evaluating 
the functionality of regulatory sequences. Reporter genes that encode for easily assayable 
polypeptides are well known in the art. In general, a reporter gene is a gene which is not present 
in or expressed by the recipient organism or tissue and which encodes a polypeptide whose 
expression is manifested by some easily detectable property, e.g., enzymatic activity. Preferred 
genes include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coll, the beta- 
glucuronidase gene (gus) of the uidA locus of E. coll, and the luciferase gene from firefly 
Photinus pyralis. Expression of the reporter gene is assayed at a suitable time after the DNA has 
been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can transform target cells 
are well known to those skilled in the art, and the same compositions and methods of 
construction may be utilized to produce the DNA useful herein (Sambrook et ah, supra). 

Transformation into Host Cells 

The recombinant DNA can be readily introduced into host cells, e.g., mammalian, 
bacterial, yeast or insect cells by transfection with an expression vector comprising DNA 
encoding sEGFR by any procedure useful for the introduction into a particular cell, e.g., calcium 
phosphate precipitation, lipofection, electroporation, and the like, to yield a transformed cell 
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having the cDNA stably integrated into its genome, so that the DNA molecules, sequences, or 
segments, of the present invention are expressed by the host cell. That is, the present invention 
also provides a transformed host cell having a genome augmented by a recombinant (non-native) 
DNA sequence, preferably by a chromosomally integrated recombinant (genetically engineered) 
5 DNA sequence that includes a gene encoding a sEGFR. 



sEGFR Polypeptides 

The present invention provides an isolated, purified sEGFR polypeptide, which can be 

1 0 prepared by recombinant DNA methodologies. The general methods for isolating and purifying 
a recombinantly expressed protein from a host cell are well known to those in the art (Sambrook 
et aL, supra). Moreover, since the present invention provides the complete amino acid sequence 
of sEGFR (SEQ ID NO:l), sEGFR or bioactive variants thereof can also be synthesized by the 
solid phase peptide synthetic method (Stewart et al, Solid Phase Peptide Synthesis, W.H. 

15 Freeman Co., San Francisco (1969); Merrifield, J. Am. Chem. Soc, 85, 2149 (1963); 
Meienhofer, "Hormonal Proteins and Peptides," ed.). 

sEGFR polypeptide expressed in a recombinant cell is purified from recombinant cell 
proteins or polypeptides to obtain preparations that are substantially homogenous. For example, 
the culture medium or lysate can be centrifiiged to remove particulate cell debris. The membrane 

20 and soluble protein fractions are then separated. sEGFR polypeptide can then be purified from 
contaminant soluble or membrane proteins and polypeptides by fractionation on immunoaffinity 
or ion exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
or on an anion exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium 
sulfate precipitation; gel filtration using, e.g., Sephadex G-75; or ligand affinity chromatography, 

25 and the like. An example of such purification is provided in Ilekis et aL, supra, as well as in 
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Example II herein, 

sEGFR polypeptide, variant sEGFR polypeptides or biologically active subunits thereof 
can also be prepared by in vitro transcription and translation reactions. A sEGFR expression 
cassette can be employed to generate sEGFR transcripts that are subsequently translated in vitro 
5 so as to result in a preparation of substantially homogenous sEGFR, variant sEGFR, or 
biologically active subunits thereof. The construction of vectors for use in vitro 
transcription/translation reactions, as well as the methodologies for such reactions, are well 
known to the art. 

Once isolated from the resulting transgenic host cells or from in vitro 
10 transcription/translation reactions, derivatives and chemically derived variants of the sEGFR 
polypeptide can be readily prepared. For example, amides of the sEGFR polypeptides of the 
present invention may also be prepared by techniques well known in the art for converting a 
carboxylic acid group or precursor, to an amide. A preferred method for amide formation at the 
C-terminal carboxyl group is to cleave the polypeptide from a solid support with an appropriate 
1 5 amine, or to cleave in the presence of an alcohol, yielding an ester, followed by aminolysis with 
the desired amine. 

Salts of carboxyl groups of the sEGFR polypeptide may be prepared in the usual manner 
by contacting the peptide with one or more equivalents of a desired base such as, for example, a 
metallic hydroxide base, e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, 
20 for example, sodium carbonate or sodium bicarbonate; or an amine base such as, for example, 
triethylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the present polypeptides may be prepared by 
utilizing an N-acyl protected amino acid for the final condensation, or by acylating a protected or 
unprotected peptide. O-acyl derivatives may be prepared, for example, by acylation of a free 
25 hydroxy peptide or peptide resin. Either acylation may be carried out using standard acylating 
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reagents such as acyl halides, anhydrides, acyl imidazoles, and the like. Both N- and O-acylation 
may be carried out together, if desired. In addition, the internal sEGFR amino acid sequence of 
SEQ ID NO:l can be modified by substituting one or two conservative amino acid substitutions 
for the positions specified, including substitutions that utilize the D rather than L form. 
5 The invention is also directed to variant or modified forms of the sEGFR polypeptide. 

One or more of the residues of this polypeptide can be altered, so long as the variant polypeptide 
has at least about 50%, preferably at least about 80%, and more preferably at least about 90%, of 
the biological activity of the polypeptide having SEQ ID NO:l . Conservative amino acid 
substitutions are preferred » that is, aspartic-glutamic as acidic amino acids; 
10 lysine/arginine/histidine as basic amino acids; leucine/isoleucine, methionine/valine, 

alanine/valine as hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic 
amino acids. 

Acid addition salts of the polypeptides may be prepared by contacting the polypeptide 
with one or more equivalents of the desired inorganic or organic acid, such as, for example, 
15 hydrochloric acid. Esters of carboxyl groups of the polypeptides may also be prepared by any of 
the usual methods known in the art. 



sEGFR Variant Polypeptides 

20 It is envisioned that variant sEGFR polypeptides have at least one amino acid substitution 

relative to SEQ ID NO:l. In particular, amino acids are substituted in a relatively conservative 
manner. Such conservative substitutions are shown in Table I under the heading of exemplary 
substitutions. More preferred substitutions are under the heading of preferred substitutions. 
After the substitutions are introduced, the products are screened for biological activity. 

25 
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TABLE I 

Original Exemplary Preferred 

Residue Substitutions Substitutions 

Ala (A) val; leu; ile val 

Arg (R) lys; gin; asn lys 

Asn (N) gin; his; lys; arg gin 

Asp (D) Glu glu 

Cys (C) Ser ser 

Gin (Q) Asn asn 

Glu (E) Asp asp 

Gly (G) Pro pro 

His (H) asn; gin; lys; arg arg 

He (I) leu; val; met; ala; phe; leu 

norleucine 

Leu(L) norleucine; ile; val; met; ala; ile 

phe 

Lys (K) arg; gin; asn arg 

Met (M) leu; phe; ile leu 

Phe (F) leu; val; ile; ala leu 

Pro(P) Gly gly 

Ser(S) Thr thr 

Thr (T) Ser ser 

Trp (W) Tyr tyr 

Tyr (Y) trp; phe; thr; ser phe 

Val(V) ile; leu; met; phe; ala; leu 

norleucine 
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Amino acid substitutions falling within the scope of the invention, are, in general, 
accomplished by selecting substitutions that do not differ significantly in their effect on 
maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for 
5 example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at 
the target site, or (c) the bulk of the side chain. Naturally occurring residues are divided into 
groups based on common side-chain properties: 



(1) 


hydrophobic: norleucine, met, ala, val, leu, ile; 


(2) 


neutral hydrophilic: cys, ser, thr; 


(3) 


acidic: asp, glu; 


(4) 


basic: asn, gin, his, lys, arg; 


(5) 


residues that influence chain orientation: gly, pro; and 


(6) 


aromatic; trp, tyr, phe. 



15 It is preferred that the non-identical amino acids of engineered versions of these 

embodiments be conservatively substituted relative to the amino acids of SEQ ID NO. 1. It is 
also preferred that the proteins of the invention have at least 10% of the biological activity of the 
polypeptide SEQ. ID NO. 1, more preferably at least 50% of the biological activity of the 
polypeptide SEQ. ID NO. 1, and most preferably at least 90% of the biological activity of the 

20 polypeptide SEQ. ID NO. 1. The activity of the sEGFR polypeptides of the invention can be 
measured by methods well known to the art including, but not limited to, ligand binding assays 
(Flickinger et al. 9 Mol. Cell. BioL, 12, 883 (1992)), the ability of the sEGFR to be bound by 
antibodies specific for the extracellular ligand binding domain of EGFR (see Example V, Maihle 
et aL, supra , and Ilekis et al 9 supra), the ability of the sEGFR to inhibit the kinase activity of 

25 the full-length EGFR, and growth inhibition assays (see Example III). Amino acid substitutions 
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are introduced into the DNA molecules of the invention by methods well known to the art. For 
example, see the description herein above for the introduction of silent mutations into the DNA 
molecules of the invention. 

5 Immunogenic Polypeptide and Conjugates 

Because the full amino acid sequence of the pi 10 sEGFR has been elucidated by the 
present invention, the invention also provides polypeptides useful for generating antibodies 
specific to the proteins encoded by the nucleic acid sequences of the invention, including SEQ ID 
NO. 1, SEQ ED NO. 3, SEQ ID NO. 4, SEQ ID NO. 5 5 and SEQ ID NO. 6. By utilizing a portion 

10 of the sEGFR sequence which contains some of the 78 novel amino acids of the carboxy 

terminus of the disclosed EGFR proteins, antibodies raised to these peptides may be specific for 
the placental pi 10 sEGFR, but non-reactive with full length EGFR or other sEGFR isoforms 
such as p60. Thus, polyclonal or monoclonal antibodies particularly useful for detecting pi 10 
sEGFR in patient samples may be made using the disclosed 78 amino acid sequence. 

15 Polypeptides for use as immunogens will typically be smaller than the full sEGFR 

protein, ranging from 10 to about 500 amino acids in length. Although they may contain other 
amino acids for, e.g., conjugation purposes, the immunogenic polypeptides of the invention 
comprise an amino acid sequence specific for sEGFR which is of 10 to 25 amino acids in length, 
more preferably 11 to 21 amino acids in length, and most preferably 14 to 20 amino acids in 

20 length. These lengths of specific sequence are typical of those used in the art for conjugation to 
immunogenic carrier molecules. The amino acid sequence is identical to an amino acid sequence 
of similar length in an amino acid sequence selected from the group consisting of: amino acids 
628-705 of SEQ ID NO. 1, amino acids 628-705 of SEQ ID NO. 4, amino acids 628-705 of SEQ 
ID NO. 5, and amino acids 628-705 of SEQ ED NO. 6. The polypeptides may optionally 

25 comprise further portions of the sEGFR amino acid sequence which is not specific for the pi 10 
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sEGFRs, e.g., exon 15 and 15 b of sEGFR. 

These polypeptides may be conveniently conjugated to immunostimulatory carrier 
molecules. Preferred carrier molecules for use in immunogenic conjugates include keyhole 
limpet hemocyanin (KLH), ovalbumin, and bovine serum albumin (BSA), however, other 
5 suitable carriers may be used. Conjugation chemistries for this purpose are well known in the 
art, and are available in prepackaged kits (e.g., from Sigma- Aldrich.) Once conjugated, the 
immunogenic peptide-carrier conjugates may then be used to immunize animals to produce 
monoclonal or polyclonal antibodies. Standard injection regimes, with our without the use of 
adjuvants (alum, Freunds's, etc.), may be used to produce the desired immune response. 
10 After harvesting, monoclonal or polyclonal antibodies produced using the above 

polypeptides which are specific for the pi 10 EFGRs, and which do not cross-react with other 
ErbBl isoforms, may be selected by screening for binding to recombinant pi 10 and pi 70 EGFR, 
as described in the Examples below. 

15 Immunoassays for sEGFRs, and their Diagnostic Uses 

An important aspect of the invention is a sandwich immunoassay method for detecting or 
determining the concentration of soluble human epidermal growth factor receptor in a biological 
sample obtained from a patient. The method comprises: a) contacting an amount of a first 
purified antibody that specifically reacts with a first epitope of the extracellular ligand binding 

20 domain of sErbBl with the patient biological sample to be tested, wherein the first purified 
antibody is modified with a first labeling moiety, b) contacting the sample with an amount of a 
second purified antibody that specifically reacts with a second epitope of the extracellular ligand 
binding domain of sErbBl, wherein the second purified antibody is modified with a second 
labeling moiety, and wherein the second purified antibody does not competitively inhibit the 

25 binding of the first purified antibody, and c) determining the presence or amount of the soluble 
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epidermal growth factor receptor complexed with said antibodies by detecting the co-presence of 
the first and second labels. 

In preferred embodiments of the assays of the invention, one antibody is either MAb R.1 
or an antibody which binds to the same epitope as MAb R. 1 . In preferred embodiments, the 
5 second antibody is MAb 528, or an antibody which binds to the same epitope as MAb 528. 
Antibodies may be tested for common epitopes by competitive binding assays according to 
methods standard in the immunochemical arts. Competitive binding assays using pi 70 ErbBl, 
pi 10 sErbBl, or any other ErbBl isoform with ligand binding subdomains I - IV may be used as 
the antigen in a competitive binding assay to determine antibodies for use in the assays of the 
10 invention. 

It is not intended that the above assays be limited to any particular immunochemistry 
format. Thus, standard sandwich, soluble sandwich, and competitive binding formats are 
envisioned for use in the assays of the invention. However, applicants have found that a standard 
sandwich assay format, such as described in Example V, works well, and thus will be used to 

15 generally describe the assay. In this format, the first labeling moiety is a binding moiety, such as 
a hapten or biotin. The first labeled antibody is then bound to a streptavidin or avidin coated 
solid support, such as a microtiter plate well. This serves as a detection mechanism for the first 
labeling moiety, as the location of the moiety (i.e., bound to the well of the microtiter plate) is 
then known. After the first antibody is bound to the wells, a patient biological sample may then 

20 be introduced into the wells. 

In preferred embodiments of this aspect of the invention the patient biological sample is 
chosen form the group consisting of blood, serum, plasma, urine, and ascites fluid. Especially 
preferred samples are serum, and plasma, after incubation with the first antibody, the wells are 
rinsed, and then the second labeled antibody is added. In this format, the second labeling moiety 

25 is a detectable labeling moiety such as a fluorescent, colorigenic, or chemiluminescent moiety. 
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A preferred moiety for use as the second labeling moiety is acridinium, as the applicants have 
found its strong signal useful in determining especially low concentrations (femtomolar range). 
After incubation with the second antibody, the wells are again rinsed. Proper reaction 
components are then added, and the second labeling moiety detected by fluorometry, 
5 colorimetry, or luminometry. Thus, the co-presence of the two labels is determined by their 
location (attached to the well) and their detectable product (fluorescence, light, or colorimetric 
product). 

The invention further provides a diagnostic method for determining the presence, or risk, 
of an ovarian carcinoma in a female human patient. The method comprises a) determining the 

10 concentration of soluble EGFR in a biological sample obtained from a female patient (e.g., by the 
above immunochemical method b) comparing the concentration obtained in a) with a normal or 
baseline level for soluble EGFR established with samples from female humans without ovarian 
cancer, and c) correlating a decrease in the concentration of sEGFR in the patient's sample with 
the presence of an ovarian carcinoma in the patient. A baseline for an assay of the invention may 

1 5 be established from biological samples from healthy patients, as in Example VI. It is desirable to 
establish baseline concentrations for a range of ages and physiological conditions (such as 
pre/post menopause) in order to better match the baseline value to the patient being tested. As 
shown in Example VII, the presence of ovarian cancer in female patients is strongly correlated 
with a reduced serum sErbBl concentration in those patients. Thus, the assays of the invention 

20 may be useful for screening patients for ovarian cancer using simple phlebotomy samples. 

In further embodiments of the assay of the invention, a female patient may be monitored 
with repeated testing to determine the onset or progression of ovarian cancer. Repeated testing 
may be done as a yearly screening of a patient to better detect the onset of disease, or at shorter 
intervals if a patient is at high risk for the disease. Additionally, the female patient may be tested 

25 before and after radiation, chemotherapy, or surgical treatment to monitor the regression or 
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progression of ovarian cancer. This follow-up testing may be done at regular intervals, such as 
monthly or weekly, or at other intervals if indicated by the patient's condition. As shown in 
example VII, the serum levels of female patients with ovarian cancer may change over the course 
of treatment, indicating an improved prognosis. Conversely, a decrease in serum sErbBl levels 
5 may be used to catch remission of the disease, or recurrences after cytoreductive surgery. 

The results presented below show that ovarian cancer patients with survival times twice 
that of the mean survival times of patients with ovarian cancer had detectable levels of sEGFR 
while patients with average, or less than average, survival times had undetectable levels of 
sEGFR. Thus, the absence, or low levels, of sEGFR may be indicative of aggressive disease. 
10 The following examples are offered to further illustrate the various aspects of the present 

invention, and are not meant to limit the invention in any fashion. Based on these examples, and 
the preceding discussion of the embodiments and uses of the invention, several variations of the 
invention will become apparent to one of ordinary skill in the art. Such self-evident alterations 
are also considered to be within the scope of the present invention. 

15 

EXAMPLE I 



Isolation and Characterization of Human c-erbBl cDNAs Encoding Soluble EGFRs 

To isolate human EGFR clones that lack sequences encoding the cytoplasmic domain, but 
20 have the extracellular domain, differential hybridization was employed to screen an oligo-dT 
primed human placental cDNA library (Clontech, cat. # HI 144x). The library was screened for 
clones that were positive for a ligand binding domain (LBD) specific probe (positions 174-2105), 
but negative for a kinase domain (KD) probe (see Figure 1 for full length/ pi 10 comparison). 

The ligand binding domain probe was synthesized by the PCR using pXER as a template 
25 (Chen et al. 9 Nature , 32, 820 (1987)). The forward primer was: SEQ ID NO: 7, corresponding to 
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nucleotide positions 174-193. The reverse primer had the sequence SEQ ID NO: 8, representing 
base pairs 1986-2105. Nucleotide numbering is according to Ullrich et aL, supra, unless stated 
otherwise. Amplification was performed for 35 cycles (94° C for 1 minute; 65° C for 1 minute; 
72° C for 3 minutes) with a final extension at 72° C for 10 minutes. The PCR product was then 
5 excised from a low melting point agarose gel A 768 bp EcoRI fragment from pXER was gel 
purified and used as the intracellular kinase domain (KD) probe. The LBD and KD probes were 
radiolabeled with [oc- 32 P]dCTP using a random primer DNA labeling kit (Gibco BRL) according 
to the manufacturer's instructions. The hybridizations were performed in a solution containing 
6X SSC, 5X Denhard's, 7.5% dextran sulfate, 0.5% N-lauryl sarcosine, and 100 ^g/ml salmon 

10 sperm DNA at 65° C. Filters were washed in 0.1X SSC and 0.1% N-lauryl sarcosine at 65° C 
and then were exposed to x-ray film for 24 to 72 hours at -80° C with an intensifying screen. 

Several clones hybridizing exclusively to the LBD probe were purified. Plasmid DNA, 
which contained inserts of interest, was released from the p^DR2 vector by site-specific 
recombination using the CRE-lox system (Murphy et aL, supra). Inserts were sequenced on both 

15 strands using the Taq DyeDeoxy cycle sequencing kit and the Applied Biosystems model 373 A 
automated DNA sequencer. 

To determine whether the transcript represented by pi 10 cDNA is expressed in human 
placenta, RNA from a human placenta cell line (ATCC, CRL 1584) was isolated by a guanidine 
isothiocyanate procedure. Isolated RNA was treated with RNase free DNase and extracted twice 

20 with 1 : 1 phenohchloroform. RNA (1 |ug) was heated to 90° C for 5 minutes, then the RNA was 
reverse transcribed in a 20 p,l reaction containing IX Avian Myeloblastosis Virus (AMV) 
reaction buffer, 1 MM each dNTP, 10 mM dithiothreitol (DTT), 20 U RNAsin, 10 U AMV 
reverse transcriptase, and 0.1 ^g oligo-dT at 24° C for 10 minutes, 42° C for 50 minutes, 99° C 
for 5 minutes and then 4° C for 5 minutes. The first strand cDNAs were then amplified by 

25 adding Taq polymerase to the reverse transcription reaction along with EXI5F and EX15R in a 
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final volume of 100 p,l under the amplification conditions described hereinabove. The amplified 
products were analyzed by 5% PAGE. The results show that the RNA represented by 3.0 kb 
transcript is expressed in the human placenta cell line. 

Thus, the isolated clone represents a 3.0 kb transcript of EGFR. To map the 3.0 kb 
5 transcript, the following primers were employed: P1981 (EX15F; SEQ ID NO:9), P267F 
(EX15bF; SEQ ID NO: 10), P615F (EX15bF; SEQ ID NO: 11), P297R (EX 15bR; SEQ ID 
NO: 12) and P732R (EX15bR; SEQ ID NO:13). The 3.0kb transcript arises from an alternative 
splicing event from exon 15 to a novel exon located within intron 15 of the EGFR gene. This 
novel exon contains 2 polyadenylation sites. None of the downstream EGFR exons are included 

10 in this transcript. This transcript differs from the 2.8 kb transcript unique to A43 1 cells as the 
A431 transcript contains EGFR exons 1 to 16 and then splices to an unrelated sequence derived 
from a translocation. The 3.0 kb transcript encodes a polypeptide of 681 amino acids (less the 24 
amino acid signal peptide) (pi 10) (SEQ ID NO:l) containing 78 unique carboxy-terminal amino 
acids, amino acids Pro 628- His 705 of SEQ ID NO. 1. 

1 5 Other variant unique EGFR-related sequences that have been detected by either cDNA 

cloning or by PCR include truncation at amino acid Phe 641 (SEQ ID NO:3), Gin to Arg at 
amino acid 657 (SEQ ID NO:4) 5 Pro to Leu at amino acid 661 (SEQ ID NO:5) ? and Ser to Phe at 
amino acid 703 (SEQ ID NO:6). 

Therefore, soluble forms of human EGFR, as well as amino acid variants of these 

20 sEGFRs, are expressed in placental tissue. 

EXAMPLE II 

Soluble Human c-erbBl Gene Product 

25 The amino acid sequence deduced from the 3.0 kb transcript cDNA, (SEQ ID NO:l), 
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predicted a 681 amino acid polypeptide after cleavage of the signal peptide with a molecular 
mass of 77 kD. The first 24 amino acids code for a signal peptide; following cleavage by signal 
peptidases, the predicted molecular weight of this polypeptide is 75 kD. The sequence encodes 
subdomains I, II, III and a portion of subdomain IV of the extracellular ligand binding domain of 
5 the EGFR. A quail fibroblast cell line, QT6, was transiently transfected with the pDR2241, 
which contains the 3.0 kb sErbBl transcript and encodes a 1 10 kD polypeptide (pi 10) that has 
ErbBl ligand binding subdomains I through IV plus an additional 78 unique carboxy-terminal 
amino acids. Cells were transfected with 15 jag of pDR2241 by the calcium phosphate 
precipitation technique as described previously (Ausubel et al. 9 Curren Protocols in Molecular 

10 Biology, John Wiley & Sons, NY (1994)). 

Transfected cells from two 10 cm plates were pooled and replated in 6 well plates 
approximately 48 hours post-transfection. The following day, cells were reused once in 
phosphate buffered saline (PBS) and labeled in methionine free DMEM supplemented with 5% 
dialyzed FCS and 150 gCi/ml of [ 35 S] methionine (Promix, Amersham) at 37° C for 12 hours. 

1 5 Conditioned medium from labeled cells was collected and centrifuged briefly to remove loose 
cells and debris and phenylmethylsulfonyl fluoride (PMSF) and aprotinin were added to a final 
concentration of 1 mM and 50 jug/ml. Cell monolayers were lysed and immunoprecipitated with 
the addition of 1 to 5 jig of monoclonal antibody as described previously (Maihle et al, Mol. 
Cell. Biol., 8, 4868 (1988)). Samples were resuspended in 2X Laemmli sample buffer (125 mM 

20 Tris-HCl, pH 6.8, 4% SDS, 20% glycerol, 10% 2-mercaptoethanol, 2 mM EDTA, 0.04% 

bromphenol blue), boiled for 5 minutes and separated by 10% SDS-PAGE. Gels were stained 
with Coomassie blue, treated with EnHance (Dupont) and dried before an overnight exposure to 
x-ray film. 

Immunoprecipitation of mock transfected cells failed to reveal a specific EGFR related 
25 polypeptide in either cell lysates or in conditioned media, while a 1 15 kD soluble EGFR was 
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immunoprecipitated from the media of control A431 cells. As predicted, immunoprecipitation of 
transfected cells revealed a heterogeneous 1 10 kD species that was specifically recognized by the 
EGFR specific monoclonal antibody, Rl (Amersham, RPN 513) (Waterfield et aL 9 J. Cell. 
Biochem., 20, 149 (1989)). Thus, expression of the nucleic acids of the invention encoding 
5 soluble truncated forms of the human EGFR results in a glycosylated sEGFR protein when 
transgenically expressed in mammalian cells. 



EXAMPLE III 

10 

Growth inhibitory Potential of Soluble c-erbBl Receptors on Ovarian Carcinoma Cell 
Growth in Vitro 

In order to examine the affect of pi 10 sErbBl on EGFR-regulated cell growth, stable CHO 
1 5 cell lines expressing pi 10 and/or EGFR have been established and clonally isolated. Co- 
expression of pi 10 sErbBl in cells expressing pi 70 EGFR results in the rapid and unexpected 
induction of cell rounding (24 hr) and programmed cell death (48 hr). See Figure 5. We have 
verified that the mechanism of cell death is apoptotic based on nuclear morphology, and Hoechst 
staining. Interestingly, cell death does not occur in CHO cells treated similarly but expressing a 
20 truncated EGFR mutant that has lost most of its extracellular domain (i.e., the type III variant, 
originally cloned from a human glioma). These results were initially discovered using transient 
transfection analyses, but since that time inducible expression of pi 10 in CHO cells has been 
established using an ecdysone promoter system (Invitrogen), and have found identical results 
using this inducible system. These results suggest that pi 10 sErbBl may be able to interfere 
25 with EGFR-dependent cell substrate attachments, and that these attachments are dependent on 
the extracellular domain of the EGFR. 

To determine the function of the human 3.0 kb alternative transcript which produces pi 10 
sErbBl, we used a quantitative ribonuclease protection assay was used to determine its relative 
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abundance in RNA from 17 adult (brain, breast, colon, heart, kidney, liver, lung, pancreas, 
placenta, prostate, skeletal muscle, small intestine, spleen, stomach, testis, thymus, and uterus) 
and 4 fetal tissues (brain, kidney, liver, and lung), as well as in numerous carcinoma-derived cell 
lines, either with or without EGFR gene amplification. The riboprobe consisted of 313 nt (1754- 
5 2066 in X00588) shared by both the full-length and the 3.0 kb transcripts, plus 134 bp of exon 
15B which was specific for the 3.0 kb mRNA. Transcripts containing exon 15B would result in 
a protected fragment of 447 nts, whereas those containing exon 15 spliced to exon 16 would 
result in a protected fragment of 3 13 nts. The full-length transcripts were observed in all of the 
samples examined, while the 3.0 kb transcript was detected only in human placenta, and in the 

10 carcinoma cell lines which also contained amplification of the EGFR gene (Figure 7 A). The 
relative ratio of full-length to 3.0 kb transcripts was quantified using a phosphoimager. The 
relative level of the full-length mRNA was -200-fold greater than the level of the 3.0 kb 
transcript in both human placenta and in the MDA-MB-468 breast carcinoma cell line, which 
contains -15-fold amplification of the EGFR gene (Filmus et aL, 1985). However, in the A431 

15 carcinoma cell line containing -30-fold amplification of the EGFR gene, the full-length 
transcript was only present in 100-fold excess compared to the 3.0 kb mRNA. The 3.0 kb 
transcript was not detected in the absence of EGFR gene amplification in other carcinoma- 
derived cell lines. 

Normal cells are expected to tolerate co-expression of pi 10 sErbBl and EGFR because 
20 these proteins route to distinct membranes (i.e., apical vs. basolateral), and that co-localization of 
these receptors will result in apoptosis. Accordingly, the loss of cell polarity/membrane 
organization characteristic of high-grade carcinomas results in selection against pi 10 sErbBl 
expression in ovarian tumors (our preliminary results suggest that this is, indeed, the case). As 
pi 70 ErbBl expression is usually up-regulated in ovarian cancer cells as compared to pi 10 
25 sErbBl, increased exposure to pi 10 sErbBl is expected to prove toxic to these cells. Thus, the 
pi 10 sEGFR may be used to produce apoptosis in ovarian cancer cells: 

Once the integrity of the soluble receptor, produced as described in Example II, has been 
established (i.e., by mobility as a discrete band of the appropriate molecular weight on an SDS 
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gel) serial dilutions of these preparations are added to the culture media (MEM-a reduced FCS, 
+/- recombinant EGF/TGF-a (Collaborative Research)) of selected (c-erbB expressors vs. 
nonexpressors) ovarian carcinoma cells to assay their effect on cell growth. Changes in cell 
growth rates are determined by monitoring 3 H-thymidine incorporation, and by using an MTT 
5 cell growth assay. 

Cultures of ovarian carcinoma cells exposed to sEGFR preparations have reduced growth 
rates compared to cells which are not exposed to sEGFR. Thus, sEGFR can inhibit carcinoma 
cell proliferation. 

EXAMPLE IV 

10 Preparation of Monoclonal Antibodies 

By utilizing polypeptide sequences derived from the unique 78 amino acid carboxy 
terminus of SEQ ID NO. 1, 4, 5, or 6, specific monoclonal antibodies may be produced to SEQ 
ID NO. 1,3,4, 5, or 6 (note: as the unique carboxy terminus of SEQ ID NO. 3 is a portion of 
SEQ ID NO. l's carboxy terminus, antibodies raised using some or all of the first 14 amino acids 

15 of the 78 amino acid carboxy terminal sequence of SEQ ID NO. 1 may react with SEQ ID NO. 
3). Applicants note that the total number of sequences which may be tested for antigenicity if 10 
amino acid polypeptides are used, and sequences are generated by "walking' along the amino 
acid sequence, would be 91 (68 for SEQ ID NO. 1,10 unique sequences for SEQ ID NO. 4, 10 
unique sequences for SEQ ID NO. 5, and 3 unique sequences for SEQ ID NO. 6). This number 

20 is quite manageable, given modern polypeptide production techniques and facilities for mass 
testing of hybridoma candidates, and does not represent undue experimentation for one of 
ordinary skill in the art (predetermined peptide sequences may be ordered on the World Wide 
Web, and sent to available commercial services to generate hybridomas). Thus, monoclonal 
antibodies may be prepared by analogy to the methods described below for the preparation of 

25 subdomain I, III, and IV specific antibodies. 
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To prepare monoclonal antibodies (MAb) specific for epitopes present in domains I, III or 
IV of full-length human EGFR, synthetic peptides were prepared. The peptides were predicted 
to have a high hydrophilicity, surface probability, and antigenicity. The peptides correspond to 
amino acids 77 to 93 of subdomain I (SEQ ID NO: 14), 290-31 1 of subdomain III (SEQ ID 

5 NO: 15), 352-369 of subdomain III (SEQ ID NO: 16), and 556-567 of subdomain IV (SEQ ID 
NO: 17) of the compete ErbBl receptor, which lacks the signal peptide. Peptide immunogens 
were made by coupling the peptides to keyhole limpet hemocyanin and bovine serum albumin. 
The maleimide coupling chemistry required the addition of cysteines to the c-terminus of the 77 
to 93 and 352-369 peptides, while the native cysteines of the 290-31 1 and 556-567 peptides were 

1 0 used for coupling. 

Although every immunized mouse elicited an immunological response toward its cognate 
peptide, as determined by ELISA, only mice injected with peptides having SEQ ID NO: 15, SEQ 
ID NO:16 or SEQ ID NO:17 produced antibodies capable of recognizing the 170 kD ErbBl 
receptor (pi 70) from A431 whole cell ly sates by Western blot. Hybridomas from mice injected 

15 with peptides having amino acids 290 to 31 1 (clone 10B7), 352-369 (clones 15E1 1 and 17H3), 
and 556-567 (clone 2D2) of the mature EGFR were generated. All of these clones produced 
monoclonal antibodies (MAbs) that recognized p 170 ErbBl in Western blots, and that bound to 
A431 cells as detected by immunofluorescence microscopy. Two MAbs (15E1 1 and 2D2) were 
also compatible with immunohistochemical methods, whereby cells or tissues are embedded in 

20 paraffin, sectioned, treated with steam and citrate to retrieve masked antigens, immunolabeled, 
and processed to visualize antigen with diaminobenzidine. 

Three MAbs (10B7, 15E1 1, and 2D2) were able to immunoprecipitate pl70 ErbBl from 
whole cell lysates of A43 1 cells. One of these MAbs (1 5E1 1) immunoprecipitated a 60 kD 
soluble form of human ErbBl (p60) from whole cell lysates and culture media of transfected 

25 QT6 quail fibroblasts (QT6/pDR161). MAb 15E1 1 also detected p60 ErbBl in QT6/pDR161 
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culture media by ALISA (acridinium-linked immunosorbent assay). MAb 15E1 1 was covalently 
coupled to Protein G and this resin was employed to purify (80% homogeneity) p60 ErbBl from 
QT6/pDR161 culture media by immunoaffmity chromatography. This MAb reacted in the same 
manner to pi 10 ErbBl. 

Hybridoma clones 10B7, 15E1 1, 17H3 and 2D2 have been deposited with the American 
Type Culture Collection, in accord with the requirements of the Budapest Treaty, and granted 
Accession Nos. , ? , and , respectively. 
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EXAMPLE V 

ALISA for Detection of the sErbBl isoforms (sEGFRs) in Human Sera and Cell Culture 

5 A sensitive acridinium-linked immunosorbent assay (ALISA) to quantify sErbBl and 

ErbBl molecules in patient body fluids and tissues, respectively, was developed by applicants. 
This ALISA as described below was used to quantify serum sErbBl levels in healthy men and 
women, as further described in Example VI, as well as to demonstrate that serum samples of 
healthy men and women contain a sErbBl analog of approximately 1 10 kD (See Figure 14). The 
10 experimental results of Baron et al, J. Immunol. Methods, 219, 23 (1998) are incorporated by 
reference. The ALISA was developed using the following: 
Antibody reagents and analytes 

Sheep anti-ErbBl antiserum (cat. #06-129) was obtained from Upstate Biotechnology 
(Lake Placid, NY). Antiserum #06-129 was prepared against a recombinant fusion protein that 

15 embodies exons 15-18. Because exon 17 encodes the transmembrane domain, this protein 

embodies extracellular, transmembrane, and cytoplasmic amino acid sequences. Mouse MAbs 
specific for ErbBl ECD were obtained from various commercial sources: MAb R.l (Amersham 
Life Science, Arlington Heights, IL; Oncogene Research Products, Cambridge, MA; Santa Cruz 
Biotechnology, Santa Cruz, CA), MAb CI 1 (Cambridge Research Biochemicals, Valley Stream, 

20 NY), MAb 528 (Oncogene Research Products; Santa Cruz Biotechnology), MAb 225 (Oncogene 
Research Products), MAb LAI (Upstate Biotechnology), MAb LA22 (Upstate Biotechnology), 
MAb 111.6(NeoMarkers,Freemont,CA),MAb 199.12 (NeoMarkers). MAbs 10B7, 15E11, 
17H3, and 2D2 disclosed above are specific toward peptide epitopes of ErbBl ECD. MAb 20H5 
(Sanders and Salisbury, 1994), which is specific for the cytoskeletal protein centrin, was used as 

25 a control antibody. 



-43 - 



Patent 
257/225 



Purified pi 70 ErbBl, which is used as our standard analyte, was purchased from Sigma 
(St. Louis, MO). The protein concentration of pi 70 ErbBl given by Sigma was confirmed by 
quantitative amino acid analysis. Briefly, pi 70 ErbBl was transferred to 6 X 50 mm borosilicate 
tubes, dried, placed in a larger vial with 200 \il of 6 N HC1 plus 1% phenol for hydrolysis under 
5 vacuum for 24 hours. The hydrolyzed amino acids were injected onto a Beckman 6300 Amino 
Acid Analyzer using sodium buffers (Beckman Instruments, Fullerton, CA). Chromatographic 
data were collected and analyzed with Beckman System Gold software, using norleucine as an 
internal amino acid standard. The concentration of pi 70 ErbBl was determined by total 
recovery of alanine, phenylalanine, and proline in comparison to norleucine. The control analyte, 
10 trpEcentrin, was purified and concentration determined by Bicinchoninic Acid Assay (BCA) 
(Pierce Chemical, Rockford, IL). 

Generation ofQt6/psErbBlECD589 

A cDNA fragment encoding 589 amino acids of the human ErbBl receptor's ECD was 
15 subcloned from pXER (Chen et ah, 1989; a.k.a. pXEGFR, Opresko et a/., 1995) into pcDNA3 
(Invitrogen, Carlsbad, CA). pXER was digested with Nae I; an approximately 3.5 kb cDNA 
fragment was resolved by agarose gel electrophoresis and gel purified using GENECLEAN™ 
(BIO 101, La Jolla, CA). The 5'- Nae I restriction enzyme site was located within the pXER 
vector, whereas the 3 1 - Nae I site was positioned just before nucleotide 2026 of the c-erbBl 
20 cDNA; the nucleotide numbering system is that of Ullrich et al (1984). Xba I linkers 

(Boehringer Mannheim, Indianapolis, IN) were phosphorylated with T4 polynucleotide kinase 
and ligated to the blunt-ended approximately 3.5 kb Nae I restriction fragment with T4 DNA 
ligase. The linker-ligated restriction fragment was subsequently digested with Xba I to yield two 
DNA fragments of approximately 1.5 kb and approximately 2.0 kb. The larger approximately 
25 2.0 kb c-erbBl cDNA fragment encoding the ECD of ErbBl was purified as described above and 
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ligated to Xba I digested, calf intestine phosphatase treated pcDNA3. E. coli DH5- were 
transformed with pcDNA3 constructs containing the approximately 2.0 kb c-erbBl cDNA 
according to the method of Hanahan (Maniatis et al 9 1982) and grown on Luria-Bertani agar 
plates containing 100 fig/ml ampicillin as a selectable marker. Plasmid DNA was isolated by the 
5 boiling method from transformed bacterial colonies (Ausubel et al, 1989) and digested with 
BamH I to determine the orientation of the approximately 2 kb c-erbBl insert in pcDNA3. 
Plasmids containing the approximately 2 kb c-erbBl fragment in the sense orientation yield a 
BamH I restriction fragment of 1348 bp, whereas clones containing the approximately 2 kb o 
erbBl fragment in the antisense orientation yield a 678 bp restriction fragment. A transformed 

10 clone of E. coli DH5- containing the approximately 2.0 kb c-erbBl fragment in the sense 

orientation was identified and plasmid DNA, called psErbBlECD589, was prepared using the 
QIAGEN plasmid purification kit (QIAGEN, Chatsworth, CA) for transfection experiments. The 
quail fibroblast cell line, Qt6, was subsequently transfected with psErbBlECD589 by calcium 
phosphate precipitation (Wigler et a/., 1979) and stable, geneticin (648) resistant cells were 

15 isolated and cloned by limiting dilution. A clonal cell line, Qt6/psErbBlECD589, expressing 
plOO sErbBl was identified by immunoprecipitation of 35 S-labeled cell lysates with MAb Cll 
using methods described previously (Maihle et aL, 1988). 

Cell culture 

20 All tissue culture cells were grown in NaHC0 3 , buffered media at 37° C in 5% C0 2 and 

air. All tissue culture reagents were purchased from Gibco BRL Life Technologies (Grand 
Island, NY). A431, MDA-MB-453, and SK-BR-3 cells (ATCC) were maintained in Dulbecco's 
Modified Eagle Medium (DMEM) supplemented with 20 mM Hepes, pH 7.3, 10% fetal bovine 
serum (FBS), 1 mM sodium pyruvate, and 2 mM L-glutamine. Qt6 quail fibroblasts (Moscovici 

25 et ah, 1977; ATCC) and Qt6 cells transfected with the plasmid vectors pDR161 or 
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psErbBlECD589, i.e., Qt6/pDR161 (Reiter and Maihle, 1996) or Qt6/psErbBlECD589, respec- 
tively, were grown in DMEM supplemented with 20 mM Hepes, pH 7.3, 4% heat inactivated 
FBS, 1% chicken serum, 1 mM sodium pyruvate, and 2 mM L-glutamine. 

5 Generation of whole cell lysates for ALISA 

Tissue culture cells at 80% to 90% confluence were rinsed once with phosphate buffered 
saline (PBS; 10 mM KH 2 P0 4 /K 2 HP0 4 , 150 mM NaCl, pH 7.2), scraped from the PBS loaded 
petri dish with a cell lifter (Costar, Cambridge, MA), and harvested by centrifugation at 
approximately 1000X g for 5 minutes. The cell pellet was resuspended and lysed by adding a 

10 1:10 (w/v) ratio of membrane protein immunoprecipitation lysis buffer containing protease 

inhibitors (10 mM Trizma®, pH 7.4,150 mM NaCl, 1 mM EDTA, 1% Nonidet P-40, 150 jig/ml 
phenylmethyl sulfonyl fluoride, 2 ^ig/ml aprotinin, 0.5 (ig/ml leupeptin, 1 \xg/ml pepstatin A). 
The cell lysate was vigorously vortexed for 30 seconds and sonicated (three 10 second bursts at 
half power with 60 second cooling periods) with a sonicator (model W-225R; Heat Systems- 

15 Ultrasonics, Farmingdale, NY) to break DNA molecules. Enough 5.0 M NaCl was added to the 
cell lysate to bring the final NaCl concentration to 500 mM. Cellular debris was pelleted by 
centrifugation at 10,000X g for 10 minutes. The resulting supernatant was exchanged into 
Trizma buffered saline (TBS; 10 mM Trizma, pH 7.4, 150 mM NaCl) containing 0.02% NaN 3 by 
passage through a Sephadex G-25 fast desalting gel filtration column using a Fast Performance 

20 Liquid Chromatography system (FPLCO; Pharmacia Biotech, Piscataway, NJ), and concentrated 
by ultrafiltration with a Centricon centrifugal concentrator (Amicon, Beverly, MA). The protein 
concentration of the final whole cell lysate was determined by BCA (Pierce Chemical). 

Acridinium labeling procedure 

25 A 1 mg/ml stock solution of succinimidyl-activated acridinium ester [4-(2-succinimidyl- 
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xycarbonylethyl)phenyl-10-acridinium-9-carboxylate sulfonate; ASSAY Designs, Ann Arbor, 
MI) in dry dimethyl formamide was stored in 5 jig aliquots at -70° C. MAb IgG was exchanged 
from carrier solution into labeling buffer (0.2 M sodium phosphate buffer (NaH 2 P04/Na 2 HP04) 
pH 8.0) with a Sephadex G-25 fast desalting gel filtration column by FPLC and concentrated by 

5 ultrafiltration to 200 \il and approximately 1.0 mg/ml total protein. MAb IgG was labeled at 
room temperature with a 1:80 molar ratio of IgG to succinimidyl-activated acridinium ester for 
15 minutes in the dark. The coupling reaction was stopped by adding 100 \il of quenching buffer 
(labeling buffer with 10 mg/ml lysine monohydrochloride) and incubating for an additional 5 
minutes. Unbound acridinium ester was removed with a Sephadex® G-25 fast desalting gel 

10 filtration column by FPLCO and simultaneously exchanged into a solution containing 0.2 M 
sodium phosphate buffer, pH 7.3, and 0.02% NaN 3 . Following buffer exchange by FPLCO, the 
acridinium-labeled MAb IgG was concentrated by ultrafiltration to a volume of 100 nl; 1.0 ml of 
storage buffer (0.2 M sodium phosphate buffer, pH 7.3, 0.1% bovine serum albumin, 0.02% 
NaN 3 ) was added and the volume was reduced again to 100 The Relative Light Units (RLU)/ 

15 jlxI were determined and the final acridinium-labeled MAb IgG was stored at -70° C. 

ErbBl ECD-specific acridinium-linked immunosorbent assay 

White XENOBIND® 96 well microtiter plates (Xenopore, Saddle Brook, NJ) were 
coated overnight at 4° C with 25 jig/well of an affinity-purified goat anti-mouse IgG 2b specific 

20 polyclonal antibody in carbonate buffer (90 mM NaHC0 3 ) 1 0 mM Na 2 C0 3 , pH 9.4, 0.02% 
NaN 3 ); the antigen is attached covalently to the plate under these conditions. All incubations 
were performed on a rocker platform. The plates were washed three times with high salt Tween- 
20® wash buffer (HST-20WB; 0.05% polyoxyethylene sorbitan monolaurate [Tween-20®, 20 
mM Trizma®, pH 7.4, 500 mM NaCl, 0.02% NaN 3 ) blocked with ALISA blocking buffer 

25 (ALBB; 2.0% bovine serum albumin (BSA), 10 mM Trizma pH 7.4, 150 mM NaCl, 0.02% 
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NaN 3 ) for 1 hour at room temperature, washed with HST-20WB, incubated with 0.05 p,g/well 
anti-ErbBl ECD-specific MAb R.1 diluted in ALBB for 2 hours at 37° C, washed three times 
with HST-20WB, incubated with analyte or unknown sample for 2 h at 37° C ? washed three 
times with HST-20WB, incubated with acridinium-labeled anti-ErbBl ECD-specific MAb 528 
5 (500,000 counts/well) for 1 hour at 37° C, washed three times with HST-20WB, and read with a 
luminometer (model LB 96P; EG&G Berthold Analytical Instruments, Nashua, NH). In order to 
maintain an even temperature across the microtiter plate, all 37° C incubations were performed in 
a forced-air environmental shaker (model 3528; Lab-Line Instruments, Melrose Park, IL) that 
was modified to hold microtiter plates. Acridinium decomposition was initiated by sequentially 

10 adding a solution containing 0.441% nitric acid and 0.495% H 2 0 2 followed by a solution 

containing 0.25 M NaOH and 0.1875% cetyltrimethylammonium chloride. This treatment drives 
the acridinium ester to form an unstable dioxetanone intermediate, which decomposes to form N- 
methylacridone in its excited singlet state; relaxation to the ground state results in the emission of 
photons of light at a wavelength of 430 nm (Weeks et al. 9 1986). 

15 Positive and negative control analytes were pi 70 ErbBl (Sigma) and a 60 kD trpEcentrin 

recombinant fusion protein (Baron et al. 9 1992), respectively. Analytes were prepared as dilution 
series in ALBB. Unknown samples included whole cell ly sates, conditioned culture media, and 
human serum samples. All human sera were assayed at a 1 : 1 0 dilution in ALBB. Serial 
dilutions of whole cell lysates and conditioned culture media were made in ALBB. All assays 

20 were partially automated with a BIOMEK 1000 laboratory work station (Beckman Instruments, 
Palo Alto, CA). 

ALSIA results 

As shown in Figure 7, the above ALISA is specific for molecules containing subdomain 
25 IV of the extracellular domain of ErbB 1 (p 1 70 ErbB 1 , A43 1 mutant p 1 1 0 sErbB 1 , recombinant 
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pi 00 sErbBl), but does not bind to molecules which do not contain the subdomain (p60 sErbBl 
and other unrelated molecules). Also, as demonstrated in this and the following examples, the 
ALISA is highly sensitive, and can detect sErbBl in the femtomolar range. 

5 Immunoprecipitation of sErbBl analogs from normal human sera and Characterization 
Blood from healthy men and women was collected by Mayo Medical Laboratories 
(MML), Department of Laboratory Medicine and Pathology, in accordance with an Institutional 
Review Board approved Normal Values Study Program. Each healthy donor was required to 
provide a recent physical history that included a physical exam and the results of the following 

10 tests: Hematology Group, Chemistry Group, Lipids, Thyroid Function, and Urinalysis. Chest x- 
ray and electrocardiogram also were performed on age appropriate subjects. Detailed clinical 
records from these healthy subjects are available. Blood was allowed to clot at room temperature 
for 30 min. The serum was separated from the clot and cells by centrifugation at 2000X g for 10 
minutes, divided into 1 ml aliquots, and stored at -70° C. All serum samples from MML were 

15 collected between 1981 and 1984. Each serum sample was thawed after transfer to our 

laboratory, aliquoted into smaller volumes, and refrozen at -70° C to prevent sErbBl degradation. 
Each serum sample was, therefore, frozen and thawed only twice before measurement by 
ALISA. 

Normal human sera were cleared of lipids with Seroclear® according to the 
20 manufacturer's protocol (Calbiochem-Novabiochem, La Jo 11a, CA), diluted 1 :5 (v/v) in Affi-Gel 
Protein-A MAPs II® binding buffer (Bio-Rad Laboratories, Hercules, CA) and clarified of 
human IgG molecules by passage through a Protein-G Superose® affinity column by FPLC® 
using Affi-Gel Protein-A MAPs II® binding, elution, and regeneration buffers (Bio-Rad 
Laboratories). The column flow through was collected, concentrated by ultrafiltration, 
25 exchanged into TBS containing 0.02% NaN 3 by FPLC® with a Sephadex® G-25 fast desalting 
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gel filtration column, divided into four aliquots, and immediately incubated with uncoupled 
(MAb minus) or MAb R.1-, 225-, and 528-coupled affinity resins. Alternatively, human serum 
samples were simply clarified of human IgG molecules by incubating with Immunopure® 
Immobilized Protein-G resin (Pierce Chemical) for 30 minutes at room temperature prior to 
5 immunoprecipitation. Both methods yielded identical immunoprecipitation results. The MAb 
R.l, 225, and 528 affinity resins were prepared with an Immunopure® Immobilized Protein-G 
IgG Orientation kit according to the manufacturer's instructions (Pierce Chemical); the MAb 
minus resin was prepared in an identical manner, except that no IgG was bound to Protein-G. 
Bound proteins were eluted from these resins with MAPs II® elution buffer, adjusted to neutral 
10 pH by adding 1.0 M Trizma® buffer (pH 9.0), and analyzed by SDS-PAGE and Western 
immunoblot. 

SDS-PAGE was performed in 1.0 mm thick vertical slab minigels (7X8 cm) with a 
Mini-PROTEAN II apparatus (Bio-Rad Laboratories). The stacking gel contained 3% 
acrylamide and the resolving gel contained 10% acrylamide (Laemmli, 1970). All SDS-PAGE 

15 samples were prepared with 4X Laemmli sample buffer (250 mM Trizma®, pH 6.8, 8% SDS, 
40% glycerol, 20% B-mercaptoethanol, 4 mM ethylenediaminetetraacetic acid, 0.08% 
bromphenol Blue) and boiled for 1 minute prior to electrophoresis. Gels were processed for 
Western immunoblot analyses. 

Following SDS-PAGE, proteins were transferred to Immobilon® polyvinylidene 

20 difluoride (PVDF) membrane (Millipore, Bedford, MA) using a Type 1, Milliblot™ Graphite 
Electroblotter System (Millipore) according to the manufacturer's protocol. The membrane was 
allowed to air dry completely following protein transfer. The dry membrane was wet again with 
methanol, washed three times with Tween-20® wash buffer (T-20WB; 10 mM Trizma pH 7.4, 
0.05% Tween-20®, 150 mM NaCl), and blocked with Nonfat Dry Milk (NFDM; 5% nonfat dry 

25 milk, 10 mM Trizma , pH 7.4, 150 mM NaCl, 0.02% NaN 3 ) at 4° C overnight. NFDM blocking 
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was followed by a second blocking step with 0.5% Boehringer Blocking Reagent (Boehringer 
Mannheim) in TBS with 0.02% NaN 3 at 37° C for 1 hour. The membrane was rinsed briefly 
with T-20WB, incubated with primary antibody reagent (mixture of MAbs 15E1 1, 2D2, LA22, 
and Cll or MAbs 10B7, 15E11, 17H3, 2D2, LA22, Cll, 11L6, and 199.12 alone) at 37° C for 1 
5 hour, washed with T-20WB for 1 hour, and incubated at 37° C for 1 hour with peroxidase- 
conjugated goat anti-mouse antibody. Primary antibody reagents consisted either of neat 
conditioned culture media for MAbs 10B7, 15E1 1, 17H3, and 2D2 or of purified IgGs for MAbs 
LA22, CI 1, 1 1 1.6, and 199.12; each purified MAb was used at a final concentration of 
approximately 1 jug/ml. Conditioned culture media containing MAb consisted of RPMI-1640 

10 medium supplemented with 10% or 20% FBS, 20 mM Hepes, pH 7.3, 2 mM L-glutamine, 1 mM 
sodium pyruvate, and 0.01% thimerosal (mercury [(o-carboxyphenyl)thio]-ethyl sodium) as a 
preservative. In some experiments, the membrane was washed with T-20WB and clamped into a 
multichannel Miniblotter™ (Immunetics, Cambridge, MA). Different channels were incubated 
with different primary antibody reagents at 37° C for 1 hour, washed with T-20WB for 1 hour, 

15 and incubated at 37° C for 1 hour with peroxidase-conjugated secondary antibody. To visualize 
antibody binding, the membrane was washed with T-20WB for 1 hour and reacted with the 
enhanced chemiluminescent substrate luminol (Amersham Life Science). 

As shown in Figure 14, a 1 10 kD protein was immunoprecipitated from normal human 
male and female sera with the ECD-specific anti ErbBl antibodies used in the above ALISA, 

20 believed to be the same 1 10 kD protein isolated from a human placental cDNA library as 

described above and is comprised of the 1 1 0 kD p 1 1 0 sErbB 1 SEQ ID NO. 2, and its variants. 

Comparison to Commercially Available ErbBl ECD-specific enzyme-linked immunosorbent 
assay (ELISA) 

25 The EGFR ELISA (catalog #QIA35 from Oncogene Research Products) was performed 
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exactly according to the manufacturer's instructions; i.e., standards and serum samples were 
diluted either into standard or sample diluent, respectively. This ELIS A uses two MAbs that are 
specific for epitopes of the ErbBl receptor ECD in a sandwich configuration; the origin and 
characterization of these MAbs has not been disclosed by the manufacturer. The second MAb 
5 used in the ELIS A is biotinylated and binds peroxidase-conjugated streptavidin. Detection of 
ErbBl is achieved by conversion of the chromogenic substrate tetra-methylbenzidine to a blue 
product by peroxidase, which is subsequently converted to a yellow product by the addition of 
stop solution (2.5 N sulfuric acid). The absorbance of the yellow reaction product was quantified 
with the spectrophotometry tool of a BIOMEK 1000 laboratory work station at 415 nm; a 560 

10 nm reference filter was used for background subtraction. 

Results of these experiments are shown in Figure 15. Although the Oncogene ELIS A did 
react with some substance in the sera of healthy men and women, it did not detect any difference 
in sErbBl concentration between the two, nor did the ELIS A results correlate with the 
applicants' ALISA results. Thus, the commercially available ELIS A test appears to be less 

1 5 sensitive and accurate than the ALISA described herein. 



EXAMPLE VI 

Establishment of Baseline sErbBl Concentrations in Normal Male and Female Subjects 

In order to increase the usefulness of the above ALISA for determining abnormal sErbBl 
20 values in patient sera, baseline or normal serum sEGFR concentration values for men and women 
over a broad age range were established using the following methods: 

Serum Samples 

Blood from 88 healthy men and 144 healthy women was collected in accordance with a 
25 Mayo Foundation institutional review board-approved "Normal Values Study." Briefly, all 
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blood samples were processed into serum and stored at -70°C until they were used. Serum 
sErbBl levels were quantified for all the available samples. The gonadotropic and steroid 
hormone levels were measured in 83 of the 88 healthy men, and 123 of the 144 healthy women. 

5 Menopausal Status Determination 

Medical records were reviewed systematically to ascertain menopausal status at the time 
of the blood draw. Data collected include: date and patient age at the time of blood draw, date of 
last menstrual period (LMP), self-reported symptoms of menopause, date of surgical menopause 
(hysterectomy and/or oophorectomy), and FSH and/or LH levels. For patients who underwent 

10 surgical hysterectomy but not oophorectomy, the clinical records also were reviewed to 

determine the approximate date of clinical menopause (patient's self-report). The criteria used to 
assign menopausal status were one or more of the following: 1) age > 00 years, 2) last reported 
menstrual period > six months from the date of the blood draw, 3) symptoms of menopause, 4) 
hysterectomy, 5) oophorectomy, 6) FSH level < or > 00 IU/L, and LH level < or > 00 IU/L. 

15 Menopausal status for 6 of the 144 women could not be determined in this study, because a) FSH 
levels were equivocal or there was insufficient serum to measure these levels, b) the patient's 
medical record was incomplete; i.e., a long time interval between the patient's last medical visit 
and the blood draw or the patient transferred health care to another facility). 

20 pi 10 sErbBl ALISA 

Serum pi 10 sErbBl levels were determined by ALISA as outlined above in Example V. 
This ALISA specifically detects pi 10 sErbBl, and does not cross-react with p60 sErbBl, pi 05 
sErbB2, or full-length ErbB2, ErbB3, or ErbB4. Initially, all sera were diluted 1 : 10 in ALISA 
blocking buffer (ALBB) and assayed in duplicate in three separate trials. Serum samples 
25 yielding relative light units (RLU's) below the linear range of the assay's standard curve were re- 
assayed either undiluted or diluted 1:5 in ALBB, whereas serum samples yielding RLU's above 
the linear range of the assay's standard curve were re-assayed either diluted 1 :20 or 1 :50 in 
ALBB. For each trial, the mean RLU's for each duplicate was determined and a corresponding 
sErbBl concentration in finol/ml was calculated. The final pi 10 sErbBl concentration reported 
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here, for each serum sample, is the median value from all three trials. The inter-assay biological 
detection limit for the pi 10 sErbBl ALISAs performed was 10 finol/ml. 

Serum sErbBl Levels Differ when Adjusting for Gender, Age, and Menopausal Status 

5 When unadjusted for age, serum pi 10 sErbBl levels do not differ significantly between 

healthy men and women ranging in age from 20 to 79 years (See Figure 8). We report a median 
serum pllO sErbBl level of 6,816 finol/ml (range: 837-42,533 finol/ml) and 7,177 finol/ml 
(range: 1 14-31,465 finol/ml) for the men and women in this report, respectively. However, when 
taking age into consideration, we observe significant negative and positive associations between 

10 serum pi 10 sErbBl levels and age in these healthy men (Spearman rank correlation coefficient = 
0.4562, p=0.0001) and women (Spearman rank correlation coefficient = -0.3491, p=0.0001), 
respectively (See Figure 9 A & B). Further analysis of sErbBl levels with regard to menopausal 
status show that premenopausal and postmenopausal women have median serum pi 10 sErbBl 
levels of 8,561 finol/ml (range: 341-24,294 finol/ml) and 3,400 finol/ml (range: 114-31,465 

15 finol/ml), respectively (See Figure 10). These data show that pi 10 sErbBl levels are 

significantly higher in premenopausal women than postmenopausal women (pO.OOOl). To 
determine if this difference is simply related to age, we next attempted to age match (+ 1 year) 
men to these groups of premenopausal and postmenopausal women. Although we were able to 
age match younger men to a subgroup of 71 premenopausal women, we were unable to age 

20 match enough older men to the postmenopausal group of women. We report a median serum 
pi 10 sErbBl level of 8,740 finol/ml (range: 747-24,294 finol/ml) for the subgroup of 
premenopausal women and 5,883 finol/ml (range: 837-28,602 finol/ml) for the age-matched men 
(See Figure 1 1). Interestingly, the serum pi 10 sErbBl levels of the premenopausal women is 
also significantly higher than that of the younger men (p=0.009). These observations indicate 

25 that gender differences in serum sErbBl levels exist between younger men and women, as well 
as between men and women 40 years and older. Moreover, these data suggest that the observed 
differences in sErbBl levels between premenopausal and postmenopausal women are not simply 
a function of age, but may be modulated by the milieu of circulating gonadotropic and sex 
steroid hormones. 
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EXAMPLE VII 

Ovarian Cancer Patients Exhibit Significantly Reduced Serum sErbBl Concentrations 

5 To explore the diagnostic potential of sErbB 1 , applicants utilized the ALIS A described in 

Example V to compare pre-operative serum sErbBl levels between 149 healthy women, 164 
stage I, II, III, or IV ovarian cancer patients, 142 patients with benign ovarian tumors, and 115 
patients with other benign gynecologic diseases of the pelvis. The individuals in each of these 
cohorts are within the same age range, but are not age-matched on a one-to-one basis. The 

10 cohort of patients with benign ovarian tumors had the following diagnoses: simple cyst, corpus 
luteum cyst, follicular cyst, dermoid, fibroma, mucinous cystadenoma, and serous cystadenoma; 
and the cohort of patients with other benign gynecologic pelvic diseases had the following 
diagnoses: paratubal cyst, cervical dysplasia, endometriosis, fibroids, and hydrosalpinx. This 
scattergram in Figure 12 clearly shows that serum sErbBl levels in EOC patients are 

15 significantly lower than those seen in healthy women (non-surgical patients), in patients with 
benign ovarian tumors, and in patients with other benign gynecologic pelvic diseases. Moreover, 
these data indicate that pre-operative serum sErbBl levels may be useful in making a diagnosis 
between early, as well as late stage EOC versus benign ovarian tumor and other benign 
gynecologic pelvic disease. 

20 

EXAMPLE VIII 

Post-operative Serum sErbBl Concentrations of Ovarian Cancer Patients are Altered as 

compared to Pre-Operative Concentrations 

The serum sErbBl levels of ovarian cancer patients was tracked using the ALIS A 
25 described in Example V according to the following methods: 

Sample Collection and ALIS A Methods 

Blood from healthy women was collected by the Department of Laboratory Medicine & 
30 Pathology, Mayo Medical Laboratories, in accordance with an ongoing institutional review 
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board-approved Normal Values Study program and processed into serum. All serum samples 
from healthy women used in this study were collected between 1981 and 1984. 

Between 1985 and 1994, serum samples from women presenting to the Mayo Clinic for 
gynecological surgery were collected and stored to study the reproducibility of C A- 125 
5 measurements in women with EOC (60-63). Patients with ovarian cancer were classified as 
having International Federation of Gynecology and Obstetrics stage I, II, III, or IV disease at the 
time of staging laparotomy and tumor reductive surgery. Serum samples were considered 
preoperative if they were collected within 30 days prior to surgery. Patients with a prior 
diagnosis of EOC that had received previous cytoreductive surgery, radiation, or chemotherapy 

10 were eliminated from our study. We identified serum samples from 21 patients, ranging in age 
from 15 to 83 years, that fit these criteria. 

Postoperative serum samples from patients with stage III or IV EOC were collected in 
accordance with North Central Cancer Treatment Group and Mayo Clinic Protocol 90-61-54, 
entitled "Cyclophosphamide plus carboplatin: comparison of conventional dose and double-dose 

15 carboplatin in patients with stage III or IV ovarian carcinoma - a Phase III study." All serum 
samples were collected between 1992 and 1994. Seventy-nine eligible patients were randomized 
to treatment on this study within 1 month after staging laparotomy and cytoreductive surgery. 

Following collection, all blood samples were allowed to clot at room temperature for 30 
min. The serum was separated from the clot and cells by centrifugation at 2000 ~ X g for 10 

20 min. divided into 1-ml aliquots, and stored at -70°C. Each serum sample was thawed after 
transfer into our laboratory, aliquoted into smaller volumes, and refrozen at -70°C to prevent 
sErbBl and EGF degradation. Each serum sample was, therefore, frozen and thawed only twice. 

Serum sErbBl levels were determined with an ALISA specific for epitopes of the ECD of 
ErbBl according to Example V with the following ALISA blocking buffer (1.0% BSA, 10mM 

25 Trizma, pH 7.4, 150 mM NaCl, 0.01% normal rabbit serum, 0.01% normal mouse serum, 0.02% 
NaN 3 ). Human sera were assayed undiluted or at dilutions of either 1 :25 or 1 : 10 in ALISA 
blocking buffer. Initially, each serum sample was tested in duplicate at a 1 :25 dilution in three 
separate experiments. Each serum sample was then tested in duplicate at a 1:10 dilution in three 
separate experiments. Finally, those serum samples that yielded undetectable sErbBl levels at 
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dilutions of 1:25 and 1:10 were tested undiluted in duplicate in three separate experiments. 
Undiluted serum samples that yielded values in relative light units below the interassay 
biological detection limit of 24 fmol/ml for this ALISA were considered undetectable. The 
sErbBl concentration reported in the scattergrams for each serum sample represents the median 
5 of the mean sErbBl level determined in three separate assays. 

Results in Pre-operative vs. Post-operative samples 

Serum samples collected within a period of 30 days prior to staging laparotomy and 
cytoreductive surgery from 21 stage III or IV EOC patients were identified; none of these 

10 patients had received prior chemotherapy, radiation, or debulking surgery. We compared the 
serum sErbBl levels in these EOC patients with the serum sErbBl levels in a group of 21 
healthy women of similar ages. The median (range) serum sErbBl concentration of the 21 age- 
matched healthy women is 6,395 fmol/ml (1,846-23,708 fmol/ml). In contrast, the median 
(range) preoperative serum sErbBl concentration of the 21 patients with stage III or IV EOC is 

15 284 fmol/ml (30-1,350 fmol/ml). These data indicate that preoperative serum sErbBl levels in 
patients with stage III or IV EOC are significantly lower than serum sErbBl levels in healthy 
women of similar ages (Wilcoxon rank sum test, P < 0.0001). 

Serum samples collected after staging laparotomy and cytoreductive surgery were also 
examined from 73 patients with stage III or IV EOC who presented for treatment between 1992 

20 and 1994. These patients had not received prior debulking surgery, radiation, or chemotherapy 
for EOC, and were enrolled in a phase III randomized clinical trial to study the efficacy of 
cyclophosphamide plus conventional dose carboplatin versus cyclophosphamide plus an 
intensive dose of carboplatin in patients with stage III and IV EOC following surgery. 

The initial postoperative (0-34 days) serum sErbB 1 levels in these 73 EOC patients were 

25 compared to serum sErbBl levels in a group of 73 healthy age-matched women (Figure 13). The 
median (range) serum sErbBl concentration of the 73 healthy women was 6,1 13 fmol/ml (1,292- 
51,358 fmol/ml). In contrast, the median (range) initial postoperative serum sErbBl 
concentration of the 73 EOC patients was 1,799 fmol/ml (nondetectable to 1 1,035 fmol/ml). 
These data indicate that the initial postoperative sErbBl levels in patients with stage III or IV 
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EOC differ significantly from sErbBl levels in an age-matched group of healthy women 

Thirty-three patients enrolled in the aforementioned phase III study provided a second 
serum sample 35-287 days after cytoreductive surgery. The median (range) serum sErbBl 
concentration of these 33 serum samples was 6,434 fmol/ml (non-detectable to 29,666 finol/ml). 
5 The median (range) serum sErbBl concentration of these 33 serum samples appeared similar to 
that seen in healthy women, with the exception of one patient who had an undetectable level of 
serum sErbBl (See Figure 13). It is noteworthy that the median serum sErbBl concentrations 
for both the initial and second postoperative serum samples appear higher than those seen in 
preoperative serum samples of patients with stage III or IV EOC. Statistical comparisons of the 

10 sErbB levels in the serum samples collected 0-34 days after cytoreductive surgery with those in 
the serum samples collected 35-287 days after cytoreductive surgery in a group of healthy 
women were not performed, because these 33 serum samples represent a subset of the 73 patients 
enrolled in the phase III study and they were collected over a protracted time period, i.e., 253 
days. Examination of the sErbBl concentration in the initial versus the second serum sample for 

15 each of the 33 patients who underwent cytoreductive surgery shows that sErbBl levels increased 
temporally for many, but not for all of these 33 patients during the course of combination 
chemotherapy. The sErbBl level of one patient in fact decreased below detectable levels. 

These results indicate the ALISA of Example V may be an important tool for monitoring 
the progression or regression of ovarian cancer in patients who are undergoing surgical, 

20 radiation, or chemotherapy treatments. The ability to monitor patient progress using sErbBl as a 
marker may be especially useful in detecting post-surgical recurrence of the disease. 

EXAMPLE IX 

Research suggests that pi 10 sErbBl is a glycosylphosphatidylinositol-anchored protein. 

Research suggests that pi 10 sErbBl is localized to the membrane via the addition of a 
25 glycosylphosphatidylinositol (GPI) anchor. The core structure common to all GPI anchors 
consists of a glycan bridge between phosphatidylinositol and phosphoethanolamine with 
phosphoethanolamine attached to the carboxy-terminus of the protein. GPI anchors are added to 
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proteins containing an appropriate carboxy-terminal signal sequence. This linkage is a post- 
translational modification that occurs in the lumen of the endoplasmic reticulum within minutes 
of protein synthesis. The carboxy-terminal signal sequence consists of a cleavage/attachment 
site, called the omega site (u), a short hinge region that contains charged amino acids, and a 
5 carboxy-terminal hydrophobic region of varying length. The carboxy-terminal signal sequences 
of several known GPI proteins, as well as the predicted signal sequences present in human pi 10 
sErbBl (and in an analogous mouse pi 10 sErbBl) are listed in Table 4. Compared to known 
GPI signal sequences, both the human (SEQ ID NO: 18; SEQ ID NO: 19) and the murine (SEQ 
ID NO: 20) sErbBl receptors contain potential carboxy-terminal GPI anchor signal sequences. In 
10 fact, the human pi 10 sErbBl product contains 2 putative signal sequences; the significance of 
tandem signal sequences is not known, but potentially either could be used. 



There will be various modifications, improvements, and applications of the disclosed 
invention that will be apparent to those of skill in the art, and the present application 
15 encompasses such embodiments to the extent allowed by law. Although the present invention 
has been described in the context of certain preferred embodiments, the full scope of the 
invention is not so limited, but is in accord with the scope of the following claims. 
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SEQUENCE LISTING 

5 <110>Maihle,Nita 
Reiter, Jill 
Baron, Andre 

10 

<120> SOLUBLE EPIDERMAL GROWTH FACTOR RECEPTOR-LIKE PROTEINS AND 
THEIR USES IN CANCER DETECTION METHODS 

15 

<130> Atairgin 257/225 

20 <150> 60/157,144 
<151> 09/30/99 

25 <160> 20 

<170> Patentln version 3.0 

30 

<210> 1 
35 <211> 705 
<212> PRT 
<213> homo sapiens 

40 

<400> 1 
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Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

5 Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

10 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu lie Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
15 65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

20 Glu Arg He Pro Leu Glu Asn Leu Gin lie He Arg Gly Asn Met Tyr 
100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

25 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
30 145 150 155 160 

Ser lie Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

35 Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

40 

Lys Leu Thr Lys He He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 
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Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
5 245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

10 Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

15 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
20 325 330 335 

Cys Asn Gly He Gly lie Gly Glu Phe Lys Asp Ser Leu Ser lie Asn 
340 345 350 

25 Ala Thr Asn lie Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His lie Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

30 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

He Thr Gly Phe Leu Leu lie Gin Ala Trp Pro Glu Asn Arg Thr Asp 
35 405 410 415 

Leu His Ala Phe Glu Asn Leu Glu lie He Arg Gly Arg Thr Lys Gin 
420 425 430 

40 His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
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Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr lie Asn Tip Lys Lys Leu 
465 470 475 480 

5 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He lie Ser Asn Arg Gly Glu 
485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
10 500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

1 5 Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie Gin Cys His Pro 
545 550 555 560 

20 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys lie Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
25 580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

30 Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

35 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
40 660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 
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Leu His Ala Tip Pro Ser Ala Ser Val He lie Thr Ala Ser Ser Cys 
690 695 700 

5 His 
705 

<210> 2 
10 <211> 2850 
<212> DNA 
<213> homo sapiens 



<400> 2 

cccggcgcag cgcggccgca gcagcctccg ccccccgcac ggtgtgagcg cccgccgcgg 60 

20 

ccgaggcggc cggagtcccg agctagcccc ggcggccgcc gccgcccaga ccggacgaca 120 
ggccacctcg tcggcgtccg cccgagtccc cgcctcgccg ccaacgccac aaccaccgcg 1 80 
25 cacggccccc tgactccgtc cagtattgat cgggagagcc ggagcgagct cttcggggag 240 
cagcgatgcg accctccggg acggccgggg cagcgctcct ggcgctgctg gctgcgctct 300 
gcccggcgag tcgggctctg gaggaaaaga aagtttgcca aggcacgagt aacaagctca 360 

30 

cgcagttggg cacttttgaa gatcattttc tcagcctcca gaggatgttc aataactgtg 420 
aggtggtcct tgggaatttg gaaattacct atgtgcagag gaattatgat ctttccttct 480 
35 taaagaccat ccaggaggtg gctggttatg tcctcattgc cctcaacaca gtggagcgaa 540 
ttcctttgga aaacctgcag atcatcagag gaaatatgta ctacgaaaat tcctatgcct 600 
tagcagtctt atctaactat gatgcaaata aaaccggact gaaggagctg cccatgagaa 660 

40 

atttacagga aatcctgcat ggcgccgtgc ggttcagcaa caaccatgcc ctgtgcaatg 720 
tggagagcat ccagtggcgg gacatagtca gcagtgactt tctcagcaac atgtcgatgg 780 
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acttccagaa ccacctgggc agctgccaaa agtgtgatcc aagctgtccc aatgggagct 840 
gctggggtgc aggagaggag aactgccaga aactgaccaa aatcatctgt gcccagcagt 900 

5 

gctccgggcg ctgccgtggc aagtccccca gtgactgctg ccacaaccag tgtgctgcag 960 
gctgcacagg cccccgggag agcgactgcc tggtctgccg caaattccga gacgaagcca 1020 
10 cgtgcaagga cacctgcccc ccactcatgc tctacaaccc caccacgtac cagatggatg 1080 
tgaaccccga gggcaaatac agctttggtg ccacctgcgt gaagaagtgt ccccgtaatt 1 140 
atgtggtgac agatcacggc tcgtgcgtcc gagcctgtgg ggccgacagc tatgagatgg 1200 

15 

aggaagacgg cgtccgcaag tgtaagaagt gcgaagggcc ttgccgcaaa gtgtgtaacg 1260 
gaataggtat tggtgaattt aaagactcac tctccataaa tgctacgaat attaaacact 1 320 
20 tcaaaaactg cacctccatc agtggcgatc tccacatcct gccggtggca tttaggggtg 1380 
actccttcac acatactcct cctctggatc cacaggaact ggatattctg aaaaccgtaa 1440 
aggaaatcac agggtttttg ctgattcagg cttggcctga aaacaggacg gacctccatg 1500 

25 

cctttgagaa cctagaaatc atacgcggca ggaccaagca acatggtcag ttttctcttg 1 560 
cagtcgtcag cctgaacata acatccttgg gattacgctc cctcaaggag ataagtgatg 1620 
30 gagatgtgat aatttcagga aacaaaaatt tgtgctatgc aaatacaata aactggaaaa 1680 
aactgtttgg gacctccggt cagaaaacca aaattataag caacagaggt gaaaacagct 1740 
gcaaggccac aggccaggtc tgccatgcct tgtgctcccc cgagggctgc tggggcccgg 1 800 

35 

agcccaggga ctgcgtctct tgccggaatg tcagccgagg cagggaatgc gtggacaagt 1 860 
gcaagcttct ggagggtgag ccaagggagt ttgtggagaa ctctgagtgc atacagtgcc 1920 
40 acccagagtg cctgcctcag gccatgaaca tcacctgcac aggacgggga ccagacaact 1980 
gtatccagtg tgcccactac attgacggcc cccactgcgt caagacctgc ccggcaggag 2040 
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tcatgggaga aaacaacacc ctggtctgga agtacgcaga cgccggccat gtgtgccacc 2100 
tgtgccatcc aaactgcacc tacgggccag gaaatgagag tctcaaagcc atgttattct 2160 
5 gcctttttaa actatcatcc tgtaatcaaa gtaatgatgg cagcgtgtcc caccagagcg 2220 
ggagcccagc tgctcaggag tcatgcttag gatggatccc ttctcttctg ccgtcagagt 2280 
ttcagctggg ttggggtgga tgcagccacc tccatgcctg gccttctgca tctgtgatca 2340 

10 

tcacggcctc ctcctgccac tgagcctcat gccttcacgt gtctgttccc cccacttttc 2400 
ctttctgcca cccctgcacg tgggccgcca ggttccaaga gtatcctacc catttccttc 2460 
15 cttccactcc ctttgccagt gcctctcacc ccaactagta gctaaccatc acccccagga 2520 
ctgacctctt cctcctcgct gccagatgat tgttcaaagc acagaatttg tcagaaacct 2580 
gcagggacgc catgctgcca gccttctccg taattagcat ggccccagtc catgcttcta 2640 

20 

gccttggttc cttctgcccc tctgtttgaa attctagagc cagctgtggg acaattatct 2700 
gtgtcaaaag ccagatgtga aaacatctca ataacaaact ggctgctttg ttcaatgcta 2760 
25 gaacaacgcc tgtcacagag tagaaactca aaaatatttg ctgagtgaat gaacaaatga 2820 
ataaatgcat aataaataat taaccaccaa 2850 

30 <210> 3 
<211> 641 
<212> PRT 

35 

<213> homo sapiens 
40 <400> 3 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 
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Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

5 Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

10 

Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
15 85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin lie lie Arg Gly Asn Met Tyr 
100 105 110 

20 Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

25 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser He Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
30 165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

35 Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys He lie Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

40 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 
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Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 

5 260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

1 0 Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

15 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly lie Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
20 340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser lie Ser Gly Asp 
355 360 365 

25 Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

30 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
35 420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

40 Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
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465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys lie He Ser Asn Arg Gly Glu 
485 490 495 

5 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Tip Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
10 515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

1 5 Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn lie Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

20 

Asp Asn Cys He Gin Cys Ala His Tyr lie Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
25 595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

30 Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe 

35 

<210> 4 
<211> 705 
40 <212> PRT 

<213> homo sapiens 
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<400> 4 



5 Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

10 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
15 50 55 60 

Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

20 Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 

25 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu lie Leu 
30 130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

35 Ser He Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

40 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 
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Lys Leu Thr Lys lie He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
5 225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

1 0 Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

15 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
20 305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

25 Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser lie Ser Gly Asp 
355 360 365 

30 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp lie Leu Lys Thr Val Lys Glu 
35 385 390 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

40 Leu His Ala Phe Glu Asn Leu Glu He lie Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
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435 440 445 

Gly Leu Arg Ser Leu Lys Glu lie Ser Asp Gly Asp Val lie He Ser 
450 455 460 

5 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
10 485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

1 5 Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

20 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
25 565 570 575 

Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

30 Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

35 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
40 645 650 655 

Arg Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 
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Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 

5 Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Ser Cys 
690 695 700 

His 
705 

10 

<210> 5 
<211> 705 
15 <212> PRT 

<213> homo sapiens 



20 

<400> 5 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

25 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
30 35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

35 Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

40 

Glu Arg lie Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 
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Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
5 130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

10 Ser He Gin Tip Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

15 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys He He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
20 210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

25 Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

30 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
35 290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

40 Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly lie Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
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340 345 350 

Ala Thr Asn lie Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

5 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
10 385 390 395 400 

He Thr Gly Phe Leu Leu lie Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

1 5 Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn lie Thr Ser Leu 
435 440 445 

20 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He lie Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
25 465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 

30 Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

35 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
40 545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn lie Thr Cys Thr Gly Arg Gly Pro 
565 570 575 
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Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

5 Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

10 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
15 645 650 655 

Gin Ser Gly Ser Leu Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 

20 Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 

Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Ser Cys 
690 695 700 

25 

His 
705 

<210> 6 

30 

<211> 705 
<212> PRT 
35 <213> homo sapiens 



<400> 6 

40 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 
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Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
5 35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

1 0 Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser He Gin Trp Arg Asp lie Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

30 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
35 195 200 205 

Lys Leu Thr Lys lie He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

40 Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
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245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

5 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
10 290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

1 5 Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
340 345 350 

20 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
25 370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

30 lie Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

35 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn lie Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu lie Ser Asp Gly Asp Val He He Ser 
40 450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 
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Phe Gly Thr Ser Gly Gin Lys Thr Lys He lie Ser Asn Arg Gly Glu 
485 490 495 

5 Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

10 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie Gin Cys His Pro 
15 545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

20 Asp Asn Cys lie Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

25 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
30 625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

35 Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 

40 

Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Phe Cys 
690 695 700 
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<400> 7 
a 15 tcggggagca gcgatgcgac 

m 

S! <210> 8 

01 

W 20 <211> 20 

m 



o 

m 25 

o 
□ 

30 



35 



40 



<212> DNA 
<213> synthetic 



<400> 8 

ccattcgttg gacagccttc 

<210> 9 
<211> 21 
<212> DNA 
<213> synthetic 



<400> 9 

gtcatgggag aaaacaacac c 



20 



20 



21 
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<210> 10 
5 <211> 20 
<212> DNA 
<213> synthetic 

<400> 10 

agtaatgatg gcagcgtgtc 

<210> 11 
<211> 20 
<212> DNA 
<213> synthetic 



<400> 11 

tcgctgccag atgattgttc 

<210> 12 
<211> 20 
35 <212> DNA 
<213> synthetic 



20 



20 



40 



<400> 12 

cgctctggtg ggacacgctg 
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<400> 13 

gaaggaacca aggctagaag 20 

0 15 

iO <210> 14 

i 

SI <211> 17 

W 20 <212> PRT 

i 

u <213> synthetic 

O 

m 

m 25 

yl <400> 14 

o 

O Leu Glu Asn Leu Gin He lie Arg Gly Asn Met Tyr Tyr Glu Asn Ser 
15 10 15 

30 

Tyr 



<210> 15 

35 

<211> 23 
<212> PRT 
40 <213> synthetic 
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<400> 15 

Asp Ser Tyr Glu Asn lie Glu Glu Asp Gly Val Arg Lys Cys Lys Lys 
15 10 15 

5 

Cys Glu Gly Pro Cys Arg Lys 
20 

<210> 16 

10 

<211> 18 
<212> PRT 
15 <213> synthetic 



H! <400> 16 

S Phe Arg Gly Asp Ser Phe Thr His Thr Pro Pro Leu Asp Pro Gin Glu 

^ 1 5 10 15 

■n Leu Asp 

ffi 25 

□ <210> 17 

<211> 12 

30 

<212> PRT 
<213> synthetic 

35 

<400> 17 

He Gin Cys Ala His Tyr He Asp Gly Pro His Cys 
40 1 5 10 

<210> 18 
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<211> 31 
<212> PRT 
5 <213> synthetic 



<400> 18 

10 

Ser Val Ser His Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu 
15 10 15 

Gly Trp He Pro Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp 
15 20 25 30 

<210> 19 

<211> 24 

20 

<212> PRT 
<213> synthetic 

25 

<400> 19 

Gly Gly Cys Ser His Leu His Leu His Ala Trp Pro Ser Ala Ser Val 
30 1 5 10 15 

He He Thr Ala Ser Ser Cys His 
20 

35 <210> 20 
<211> 34 
<212> PRT 

40 

<213> synthetic 
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Ala Asn Cys Thr Tyr Gly Cys Ala Gly Pro Gly Leu Gin Gly Cys Glu 
15 10 15 

Val Trp Pro Ser Gly Tyr Val Glu Trp Gin Trp He Leu Lys Thr Phe 
20 25 30 

Trp lie 
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WE CLAIM: 



1 . An isolated nucleic acid selected form the group consisting of: 

a) a nucleic acid which encodes a protein comprising the amino acid sequence SEQ ID 
5 NO. 1, 

b) a nucleic acid which encodes a protein comprising an amino acid sequence which is at 

least 90% identical to SEQ ID NO. 1 and which has at least 50% of the biological 
activity of the protein SEQ ID NO. 1, 

c) a nucleic acid which is complementary to nucleic acid a) or b). 

10 2. The isolated nucleic acid of claim 1 wherein the nucleic acid has the sequence SEQ ID NO. 
2. 

3. The isolated nucleic acid of claim 1 wherein the nucleic acid encodes a protein comprising 
the amino acid sequence SEQ ID NO. 3. 

4. The isolated nucleic acid of claim 1 wherein the nucleic acid encodes a protein comprising an 
15 amino acid sequence which is at least 99% identical to SEQ ID NO. 1. 

5. The isolated nucleic acid of claim 4 wherein the encoded protein comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO. 4, SEQ ID NO. 5, and SEQ ID 
NO. 6. 

6. An immunogenic conjugate comprising an immunogenic carrier molecule and a polypeptide 
20 of between 10 and 500 amino acids in length comprising an amino acid sequence of 10 to 25 

amino acids in length which is identical to an amino acid sequence of the same length 
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contained in an amino acid sequence selected from the group consisting of amino acids 628- 
705. of SEQ ID NO. 1, amino acids 628-705 of SEQ ID NO. 4, amino acids 628-705 of SEQ 
ID NO. 5, and amino acids 628-705 of SEQ ID NO. 6. 

The immunogenic conjugate of claim 6, wherein the polypeptide comprises an amino acid 
sequence of 1 1 to 21 amino acids in length which is identical to an amino acid sequence of 
the same length contained in an amino acid sequence selected from the group consisting of 
amino acids 628-705 of SEQ ID NO. 1, amino acids 628-705 of SEQ ID NO. 4, amino acids 
628-705 of SEQ ID NO. 5, and amino acids 628-705 of SEQ ID NO. 6. 

The immunogenic conjugate of claim 6, wherein the immunogenic carrier molecule is 
selected from the group consisting of keyhole limpet hemocyanin and bovine serum albumin. 

An assay for determining the concentration of soluble epidermal growth factor receptor in a 
biological sample from a human patient, the assay comprising: 

a) obtaining a biological sample from the patient; 

b) contacting an amount of a first purified antibody that specifically reacts with a first 
epitope of the extracellular ligand binding domain of sErbBl with the patient biological 
sample to be tested, wherein the first purified antibody is modified with a first labeling 
moiety; 

c) contacting the sample with an amount of a second purified antibody that specifically 
reacts with a second epitope of the extracellular ligand binding domain of sErbBl, 
wherein the second purified antibody is modified with a second labeling moiety, and 
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wherein the second purified antibody does not competitively inhibit the binding of the 
first purified antibody; and 

d) detecting the co-presence of the first and second labels to determine the concentration 
of the soluble epidermal growth factor receptor complexed with the antibodies; 

5 wherein one of the antibodies is chosen from the group consisting of: MAb R.1 and 

antibodies which competitively inhibit the binding of MAb R.1 to ErbBl; and wherein 
the other antibody is chosen from the group consisting of MAb 528 and antibodies which 
competitively inhibit the binding of MAb 528 to ErbBl. 

10. The assay of claim 9 wherein the patient biological sample is chosen from the group 
10 consisting of urine and ascites. 

1 1 . The assay of claim 1 1 wherein the patient biological sample is chosen from the group 
consisting of blood, serum and plasma. 

12. The assay of claim 1 1 wherein the first labeling moiety is an affinity binding moiety. 

13. The assay of claim 12 wherein the affinity binding moiety is biotin. 

15 14. The assay of claim 13 wherein detection of the presence of the first labeling moiety is by 

binding of the biotin moiety to a solid support coated with a molecule chosen from the group 
consisting of streptavidin and avidin. 

15. The assay of claim 9 wherein the second labelling moiety is selected from the group 
consisting of a fluorescent moiety, a colorigenic moiety, and a chemiluminescent moiety. 

20 16. The assay of claim 9 wherein the second labelling moiety is acridinium. 
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17. The assay of claim 16 wherein the detection of the presence of the second labeling moiety is 
by measuring light emitted from a chemiluminescent reaction utilizing the second labeling 
moiety. 

18. The assay of claim 9 wherein the patient is female, further comprising the steps of; 

5 e) comparing the concentration of soluble epidermal growth factor receptor obtained in 

step d) with a normal value; and 

f) correlating a decrease in the concentration of soluble epidermal growth factor receptor 
in the patient biological sample with the presence of an ovarian carcinoma in the patient. 

19. The assay of claim 18 wherein the normal value is obtained by assaying biological samples 
10 from females of approximately the same age as the patient. 

20. The assay of claim 18 further comprising the step of performing a second assay on a 
biological sample obtained from the patient at a point in time after the initial assay. 

21. The assay of claim 20, wherein the patient has undergone treatment for ovarian cancer 
selected from the group consisting of chemotherapy, radiation therapy, and surgical treatment 

15 in the interval between the initial and second assay. 

22. The assay of claim 20, further comprising the step of correlating an increase in the 
concentration of soluble epidermal growth factor receptor in the patient biological sample 
with an improved prognosis in the ovarian cancer condition. 

23. The assay of claim 20, further comprising the step of correlating a decrease in the 

20 concentration of soluble epidermal growth factor receptor in the patient biological sample 
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with an declining prognosis in the ovarian cancer condition. 
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ABSTRACT 



The present invention relates to the discovery of soluble isoforms of an Epidermal 
Growth Factor Receptor, or sErbBl variants, the provision of the sequences of nucleic acids 
encoding these isoforms, purified recombinant proteins, novel antibodies specific for these 
isoforms, and the use of immunoassay and other protein assay techniques to measure the 
concentration of these protein isoforms in a patient biological sample in the femtomolar range. 
The present invention also provides methods for determining the presence of an ovarian 
carcinoma in the patient by assaying the concentration of soluble ErbBl variants in a biological 
sample from a patient. 
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Serum sErbB 1 levels in healthy women, patients with EOC, benign ovarian * 
tumors, and other benign gynecological diseases as measured by ALISA and compared. 
Serum samples with sErbBl levels below the inter-assay biological detection limit 
(horizontal line with arrow) of 5.89 fmol/ml were arbitrarily assigned values of 5.0 
fmol/ml for graphing purposes. Each data point represents the median of the mean sErbB 1 
concentration for one serum sample tested in duplicate from a minimum of three separate 
assays. The median sErbBl concentration for each group of patients is indicated by the 
horizontal line. 
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SEQUENCE LISTING 
<110> Maihle, Nita Reiter, Jill Baron, Andre 

<120> SOLUBLE EPIDERMAL GROWTH FACTOR RECEPTOR- LIKE PROTEINS AND THEIR USES IN 
CANCER DETECTION METHODS 

<130> Atairgin 257/225 

<150> 60/157,144 

<151> 09/30/99 

<160> 20 <170> Patentln version 3.0<210> 1<211> 705<212> PRT<213> homo 
sapiens<400> 1 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 HO 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser He Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys He He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 



Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 



Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn lie Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys lie Gin Cys Ala His Tyr lie Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp lie Pro 
660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu *Gly Trp Gly Gly Cys Ser His 



Leu His Ala Trp Pro Ser Ala Ser Val lie lie Thr Ala Ser Ser Cys 
690 695 700 

His 
705 

<210> 2<211> 2850<212> DNA<213> homo sapiens<400> 2 

cccggcgcag cgcggccgca gcagcctccg ccccccgcac ggtgtgagcg cccgccgcgg 60 

ccgaggcggc cggagtcccg agctagcccc ggcggccgcc gccgcccaga ccggacgaca 12 0 

ggccacctcg tcggcgtccg cccgagtccc cgcctcgccg ccaacgccac aaccaccgcg 180 

cacggccccc tgactccgtc cagtattgat cgggagagcc ggagcgagct cttcggggag 24 0 

cagcgatgcg accctccggg acggccgggg cagcgctcct ggcgctgctg gctgcgctct 300 

gcccggcgag tcgggctctg gaggaaaaga aagtttgcca aggcacgagt aacaagctca 360 

cgcagttggg cacttttgaa gatcattttc tcagcctcca gaggatgttc aataactgtg 420 

aggtggtcct tgggaatttg gaaattacct atgtgcagag gaattatgat ctttccttct 480 

taaagaccat ccaggaggtg gctggttatg tcctcattgc cctcaacaca gtggagcgaa 54 0 

ttcctttgga aaacctgcag atcatcagag gaaatatgta ctacgaaaat tcctatgcct 600 



tagcagtctt 


atctaactat 


gatgcaaata 


aaaccggact 


gaaggagctg 


cccatgagaa 


660 


atttacagga 


aatcctgcat 


ggcgccgtgc 


ggttcagcaa 


caaccatgcc 


ctgtgcaatg 


720 


tggagagcat 


ccagtggcgg 


gacatagtca 


gcagtgactt 


tctcagcaac 


atgtcgatgg 


780 


acttccagaa 


ccacctgggc 


agctgccaaa 


agtgtgatcc 


aagctgtccc 


aatgggagct 


840 


gctggggtgc 


aggagaggag 


aactgccaga 


aactgaccaa 


aatcatctgt 


gcccagcagt 


900 


gctccgggcg 


ctgccgtggc 


aagtccccca 


gtgactgctg 


ccacaaccag 


tgtgctgcag 


960 


gctgcacagg 


cccccgggag 


agcgactgcc 


tggtctgccg 


caaattccga 


gacgaagcca 


1020 


cgtgcaagga 


cacctgcccc 


ccactcatgc 


tctacaaccc 


caccacgtac 


cagatggatg 


1080 


tgaaccccga gggcaaatac 


agctttggtg 


ccacctgcgt 


gaagaagtgt 


ccccgtaatt 


1140 


atgtggtgac 


agatcacggc 


tcgtgcgtcc 


gagcctgtgg 


ggccgacagc 


tatgagatgg 


1200 


aggaagacgg 


cgtccgcaag 


tgtaagaagt 


gcgaagggcc 


ttgccgcaaa 


gtgtgtaacg 


1260 


gaataggtat 


tggtgaattt 


aaagactcac 


tctccataaa 


tgctacgaat 


attaaacact 


1320 


tcaaaaactg 


cacctccatc 


agtggcgatc 


tccacatcct 


gccggtggca 


tttaggggtg 


1380 


actccttcac 


acatactcct 


cctctggatc 


cacaggaact 


ggatattctg 


aaaaccgtaa 


1440 


aggaaatcac 


agggtttttg 


ctgattcagg 


cttggcctga 


aaacaggacg 


gacctccatg 


1500 


cctttgagaa 


cctagaaatc 


atacgcggca 


ggaccaagca 


acatggtcag 


ttttctcttg 


1560 


cagtcgtcag 


cctgaacata 


acatccttgg 


gattacgctc 


cctcaaggag 


ataagtgatg 


1620 


gagatgtgat 


aatttcagga 


aacaaaaatt 


tgtgctatgc 


aaatacaata 


aactggaaaa 


1680 


aactgtttgg gacctccggt 


cagaaaacca 


aaattataag 


caacagaggt 


gaaaacagct 


1740 


gcaaggccac 


aggccaggtc 


tgccatgcct 


tgtgctcccc 


cgagggctgc 


tggggcccgg 


1800 


agcccaggga 


ctgcgtctct 


tgccggaatg 


tcagccgagg 


cagggaatgc 


gtggacaagt 


1860 


gcaagcttct 


ggagggtgag 


ccaagggagt 


ttgtggagaa 


ctctgagtgc 


atacagtgcc 


1920 


acccagagtg 


cctgcctcag gccatgaaca 


tcacctgcac 


aggacgggga 


ccagacaact 


1980 


gtatccagtg 


tgcccactac 


attgacggcc 


cccactgcgt 


caagacctgc 


ccggcaggag 


2040 


tcatgggaga 


aaacaacacc 


ctggtctgga 


agtacgcaga 


cgccggccat 


gtgtgccacc 


2100 


tgtgccatcc 


aaactgcacc 


tacgggccag gaaatgagag 


tctcaaagcc 


atgttattct 


2160 


gcctttttaa 


actatcatcc 


tgtaatcaaa 


gtaatgatgg 


cagcgtgtcc 


caccagagcg 


2220 


ggagcccagc 


tgctcaggag 


tcatgcttag 


gatggatccc 


ttctcttctg 


ccgtcagagt 


2280 


ttcagctggg 


ttggggtgga 


tgcagccacc 


tccatgcctg gccttctgca 


tctgtgatca 


2340 



tcacggcctc ctcctgccac tgagcctcat gccttcacgt gtctgttccc cccacttttc 2400 

ctttctgcca cccctgcacg tgggccgcca ggttccaaga gtatcctacc catttccttc 2460 

cttccactcc ctttgccagt gcctctcacc ccaactagta gctaaccatc acccccagga 252 0 

ctgacctctt cctcctcgct gccagatgat tgttcaaagc acagaatttg tcagaaacct 2580 

gcagggacgc catgctgcca gccttctccg taattagcat ggccccagtc catgcttcta 2640 

gccttggttc cttctgcccc tctgtttgaa attctagagc cagctgtggg acaattatct 2700 

gtgtcaaaag ccagatgtga aaacatctca ataacaaact ggctgctttg ttcaatgcta 2760 

gaacaacgcc tgtcacagag tagaaactca aaaatatttg ctgagtgaat gaacaaatga 2 82 0 

ataaatgcat aataaataat taaccaccaa 2850 

<210> 3<211> 641<212> PRT<213> homo sapiens<400> 3 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu lie Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser He Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 



Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys lie lie Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly lie Gly lie Gly Glu Phe Lys Asp Ser Leu Ser lie Asn 
340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 



Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 



Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn lie Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 , 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe 



<210> 4<211> 705<212> PRT<213> homo sapiens<400> 4 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 



Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 



Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu lie Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser lie Gin Trp Arg Asp lie Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys lie He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 



His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn lie Thr Ser Leu 
435 440 445 



Gly Leu Arg Ser Leu Lys Glu lie Ser Asp Gly Asp Val lie lie Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

Arg Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 

Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Ser Cys 
690 695 700 

His 
705 

<210> 5<211> 705<212> PRT<213> homo sapiens<400> 5 



Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 



1 



5 



10 



15 



Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu lie Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr lie Gin Glu Val Ala Gly Tyr Val Leu lie Ala Leu Asn Thr Val 
85 90 95 

Glu Arg lie Pro Leu Glu Asn Leu Gin lie lie Arg Gly Asn Met Tyr 
100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu lie Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser lie Gin Trp Arg Asp lie Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 

Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
195 200 205 

Lys Leu Thr Lys lie lie Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 



Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 



305 310 315 



320 



Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
34 ° 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 3 ^0 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 



61 ° 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 ^30 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

Gin Ser Gly Ser Leu Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
^75 680 685 

Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Ser Cys 
690 695 700 

His 
705 

<210> 6<211> 705<212> PRT<213> homo sapiens<400> 6 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
1 5 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 
35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
50 55 60 

Leu Glu He Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 
85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 

His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145 150 155 160 

Ser He Gin Trp Arg Asp He Val Ser Ser Asp Phe Leu Ser Asn Met 
165 170 175 

Ser Met Asp Phe Gin Asn His Leu Gly Ser Cys Gin Lys Cys Asp Pro 
180 185 190 



Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gin 
19 5 200 205 

Lys Leu Thr Lys He He Cys Ala Gin Gin Cys Ser Gly Arg Cys Arg 
210 215 220 

Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gin Cys Ala Ala Gly Cys 
225 230 235 240 

Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
245 250 255 

Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
260 265 270 

Thr Thr Tyr Gin Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
275 280 285 

Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
290 295 300 

Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305 310 315 320 

Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
325 330 335 

Cys Asn Gly He Gly He Gly Glu Phe Lys Asp Ser Leu Ser He Asn 
340 345 350 

Ala Thr Asn He Lys His Phe Lys Asn Cys Thr Ser He Ser Gly Asp 
355 360 365 

Leu His He Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
370 375 380 

Pro Pro Leu Asp Pro Gin Glu Leu Asp He Leu Lys Thr Val Lys Glu 
385 390 395 400 

He Thr Gly Phe Leu Leu He Gin Ala Trp Pro Glu Asn Arg Thr Asp 
405 410 415 

Leu His Ala Phe Glu Asn Leu Glu He He Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn He Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu He Ser Asp Gly Asp Val He He Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr He Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys He He Ser Asn Arg Gly Glu 
485 490 495 



Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Lys Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys He Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn He Thr Cys Thr Gly Arg Gly Pro 
565 570 575 

Asp Asn Cys He Gin Cys Ala His Tyr He Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625 630 635 640 

Phe Lys Leu Ser Ser Cys Asn Gin Ser Asn Asp Gly Ser Val Ser His 
645 650 655 

Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu Gly Trp He Pro 
660 665 670 

Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp Gly Gly Cys Ser His 
675 680 685 

Leu His Ala Trp Pro Ser Ala Ser Val He He Thr Ala Ser Phe Cys 
690 695 700 

His 
705 

<210> 7<211> 20<212> DNA<213> synthetic<4 0 0 > 7 

tcggggagca gcgatgcgac 20 
<210> 8<211> 20<212> DNA<213> synthetic<4 0 0> 8 

ccattcgttg gacagccttc 2 o 
<210> 9<211> 21<212> DMA<213> synthetic<400 > 9 

gtcatgggag aaaacaacac c 21 



<210> 10<211> 20<212> DNA<213> synthetic<400> 10 
agtaatgatg gcagcgtgtc 



20 



<210> 11<211> 20<212> DNA<213> synthetic<400> 11 

tcgctgccag atgattgttc 2 0 



<210> 12<211> 20<212> DNA<213> synthetic<400> 12 

cgctctggtg ggacacgctg 2 0 



<210> 13<211> 20<212> DNA<213> synthetic<400> 13 

gaaggaacca aggctagaag 2 0 



<210> 14<211> 17<212> PRT<213> synthetic<400> 14 

Leu Glu Asn Leu Gin lie lie Arg Gly Asn Met Tyr Tyr Glu Asn Ser 
15 10 15 

Tyr 



<210> 15<211> 23<212> PRT<213> synthetic<400> 15 

Asp Ser Tyr Glu Asn lie Glu Glu Asp Gly Val Arg Lys Cys Lys Lys 
15 10 15 

Cys Glu Gly Pro Cys Arg Lys 
20 

<210> 16<211> 18<212> PRT<213> synthetic<400> 16 

Phe Arg Gly Asp Ser Phe Thr His Thr Pro Pro Leu Asp Pro Gin Glu 
15 10 15 

Leu Asp 



<210> 17<211> 12<212> PRT<213> synthetic<400> 17 

lie Gin Cys Ala His Tyr lie Asp Gly Pro His Cys 
15 10 

<210> 18<211> 31<212> PRT<213> synthetic<400> 18 

Ser Val Ser His Gin Ser Gly Ser Pro Ala Ala Gin Glu Ser Cys Leu 
15 10 15 

Gly Trp lie Pro Ser Leu Leu Pro Ser Glu Phe Gin Leu Gly Trp 
20 25 30 

<210> 19<211> 24<212> PRT<213> synthetic<400> 19 

Gly Gly Cys Ser His Leu His Leu His Ala Trp Pro Ser Ala Ser Val 
15 10 15 

lie lie Thr Ala Ser Ser Cys His 



20 

<210> 20<211> 34<212> PRT<213> synthetic<400> 20 

Ala Asn Cys Thr Tyr Gly Cys Ala Gly Pro Gly Leu Gin Gly Cys Glu 
15 10 15 

Val Trp Pro Ser Gly Tyr Val Glu Trp Gin Trp lie Leu Lys Thr Phe 
20 25 30 

Trp lie 



